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Sir; 

DECLARATION OF SHI-YOU DING, Ph.D 
37 C.F.R §1.132 

1. I am the Shi-You Ding who is named as an inventor in the above- 
identified application. 

2. Exhibit A to this Declaration is my curriculum vitae. I am currently 
employed at the National Renewable Energy Laboratory in Golden, Colorado where I 
work as a Senior Research Scientist. I have many years of bench research experience in 
the field of microbiology with focus in hydrolyzing enzymes. 

3. This declaration is provided with evidence that refutes certain of the 
Examiner's arguments in the Office Action dated January 10, 2003. 

4. The first issue is to address the arguments beginning on page 7 and which 
concern the rejection of claims 1-5 and 12-13 because the specification allegedly does not 
enable claims this broad. For the reasons explained below, this cannot be the case where 
the claims identify well known families of domains. 

5. Exhibit B to this Declaration contains an excerpt of information that is 
published in the CAZy database (cited as Coutinho, P.M. & Henrissat, B. (1999) 
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Carbohydrate-Active Enzymes server at URL: http://afmb.cnrs- 
mrs.fr/-cazy/CAZY/index.html). CAZy was produced over a ten year period and 
contains cross-linkages to a variety of international sequence databases. The "Family 
GH5" domains extend from pages 1 to 13 of Exhibit B. The Family CBDIII domains 
extend from pages 16 to 18. The "Family CBDII domains extend from pages 19-22. 

6. Scientists who practice in my field are routinely trained in the use of 
computer databases, such as the CAZy, GenBank, and SwissProt databases identified in 
Exhibit B. The mere mention that a particular polypeptide domain is classified in of one 
of these families causes an instant recognition that the domain is in a similar 
classification with respect to other domains in that family. The ability to access domains 
by this classification system is a fundamental and basic part of our working knowledge. 

7. Therefore, it is simply not correct for the Examiner to say that the 
specification fails to disclose the multiple modifications encompassed by the instant 
claims. Multiple modifications are, in fact, disclosed by the domain family names. 
Those skilled in the art appreciate from the disclosure in the specification that these well 
known family domains may be linked together in perhaps thousands of permutations of 
GH5-CBDIII-CBDII domains. Furthermore, molecular genetics are sufficiently 
advanced that those following the disclosure of the present specification would routinely 
be able to make such linkages. 

8. It follows that the allegations raised as points A through E in the passage 
bridging pages 8-9 are false. The specification does disclose a specific guidance and 
rationale for practicing the claimed invention when it refers to the domain family names. 

9. Another issue is that on page 1 5 of the Office Action the Examiner has 
shifted the burden to Applicants to prove that Gibbs et al., Appl. Environ. Microbiol. 
58(12) 3864-3867 (1992) does not contain linked GH5-CBHIII-CBHII domains. 

10. Exhibit C to this Declaration is an excerpt from the Swiss Prot database, 
and it contains a summary of the sequence reported by Gibbs. The middle portion of 
page 2 thereof contains a summary of the domains. 
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11. Ibavy studied Exhibit C t classify the domains identified on page 2 
thereof. Handwritten comments of P GH5," "CBDII," and "CBDIP indicate my 
conclusion that Qibfcs teaches a sequence of GHS-CBDH-CBDII family domains. This ij, 
not the sequence of (jHS-CBDIII-CBDH that is presently recited in claim 1 . 

12. I hereby declare that all statements made herein of my own knowledge ar 2 
true and that all statements made on information and belief are believed to be true; an>i 
further that these statements were made with the knowledge that willful false statements 
and the like are punishable by fine or imprisonment, or both, under 18 U.S,C. § 1001, ajvj 
that such willful felsa statements may jeopardize the validity of the application or anf 
patent issued therec ru 
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SHI-YOU DING ph.o 



National Bioenergy Center 

National Renewable Energy Laboratory 

1617 Cole Blvd. Golden, CO 80401 



Phone: 303-384-7758 

Fax: 303-384-7752 

E-mail: shi_you_ding@nrel.gov 



EDUCATION 



Postdoctoral fellow Jhe Weizmann Institute of Science & Tel Aviv University, Israel 
PAi.D, Peking University & The Chinese Academy of Sciences, China 
M.S. The Chinese Academy of Sciences, China 
B.Sc. Anhui Normal University, China 



1997-2000 



1994 
1989 
1986 



MEMBERSHIPS 

American Society of Microbiology, American Chemical Society 



PROFESSIONAL EXPERIENCE 



Scientist II, National Renewable Energy Laboratory 
Research associate, National Renewable Energy Laboratory 



2002-present 
2000-2001 



Nanotechnology. Understanding Nanoscale Chemistry: Quantum Dot Interactions with Proteins and Carbon Nanotubes 
and the Mediation of Inter-Quantum Dot Communication. 

❖ Cloned and expressed Clostridia cellulosomal scaffoldin proteins 

❖ Making protein/quantum dot conjugates 

❖ Making quantum dots arrays by scaffoldin protein templates. 

Cellulase Fundamentals. Biochemistry and molecular biology of enzyme system of cellulytic organism 

❖ Cloned, sequenced four Glycoside Hydrolase genes from A cidothermus cellulolyticus. Four ROIs have been filed. 

❖ Developed E. coli system for Acidothermus enzyme expression. Six catalytic domain of Acidothermus enzymes have 
been expressed and purified. 

❖ Set up sequence database of CBHs (Family 7 cellulase), aligned the sequences of all available Family 7 cellulase, 
designed degenerate PCR primers for screening new CBHs from potential microorganisms. 

❖ Assisted to identify new CBHs found from Penicillium sp. and other cellulose degraders, designed PCR primers for 
further gene cloning. 

CBHI Expression. Evaluate the CBHI expression in different system, e.g. E. coli, fungi and insect cells. 

❖ Cloned the catalytic domain of T. reesei CBHI into different expression vectors, including pET22b, pET32b, pET40b, 
pMAL-p2, generated ten CBHI-cd expression constructs, with or without signal sequences, fusion proteins, and tags. 

❖ Expressed CBHI constructs in E. coli strains, including XL-lblue, BL21, BL21(DE3), BL21 trx (DE3), 
BL21(DE3)pLysS, Origami(DE3), Origami(DE3)pLysS, tested the expression level. 

❖ Cloned CBHI-cd into a E. coli -fungi shuttle vector pPFEl with or without 6xHis tag, prepared the constructs for 
expression CBHI-cd in fungi. 

Co-PI proposals 

❖ DOE Office of Science National Nanotechnology Initiative: Understanding Nanoscale Chemistry: Quantum Dot 
Interactions with Proteins and Carbon Nanotubes and the Mediation of Inter-Quantum Dot Communication. PI: Arthur 
J. Nozik 

* This proposal has been funded at a level of $750k/vr. 

❖ NASA OLMSA Microgravity Research Division: Self-assembling Proteins: new Superlattice Structures for 
Efficient Electro-optic Devices. PI: Mike E. Himmel. 

❖ DOE Office of Science Microbial Cell Project: Toward Understanding the Clostridial Cellulosome: A 
Bioinforrnatics Approach. PI: Mike E. Himmel. 

❖ DDRD: Preparation and Characterization of Novel Quantum Dot/Protein Conjugates. PI: Mike E. Himmel & Garry 
Rumbles. 



DOE Microbial Cell Project Review Panel (June 2001), 



Postdoctoral fellow, The Weizmann Institute of Science & Tel Aviv University, Israel 1997-1999 

❖ Identified high-molecular-weigh (200-400 kDa) cellulosome-related proteins, including scaffolding proteins and cell 
surface anchoring proteins, from different cellulolytic bacteria. 

❖ Constructed genomic libraries. Five genes encoding cellulosome-related protein, 6-12kb in size, were cloned from 
Acetivibrio celiulolyticus, Bacteroides cellulosolvens, and Ruminococcus flavefaciens (collaborated with Dr. Harry J. 
Flint, Rowett Research Institute, UK), respectively. 

❖ Expressed and purified three functional domains. 

❖ Crystallized and determined the crystal structure of CBD from Clostridium cellulolyticum scaffolding protein 
(collaborated with Dr. Linda Shimon et al. at The Weizmann Institute of Science). 

❖ Crystallization of cohesin domains from B. cellutosolvens, Ruminococcus flavefaciens, and co-crystal of cohesin 
domain of scaffoldin and dockerin domain of catalytic summit from C. thermocellum are still under processing. 

Assistant Investigator, Peking University, China 1995-1996 

❖ Cloning, and expression of plant resistance genes and molecular biology of Rice Dwarf Virus. 

❖ DNA fragments isolation from ancient materials. 

❖ Molecular evolution and systematics of Astragalinae. 

❖ Constructed the local molecular biological database and the Homepage of the laboratory 

Visiting Research Fallow, Peking University 1 993-1 994 

❖ Molecular phylogenetic study on Astragalinae 

Research Assistant, The Chinese Academy of Sciences 1 989-1 994 

❖ Ecological investigation in wild field, specimen identification in herbarium 

❖ Cladistic analysis of Astragalinae based on morphological and ultrastructural data. 

❖ Ecological, floristic investigation of plants in Northern China. 

❖ Microscopic and ultrastructural study on the anatomy of Staphyleaceae 

GRANT & FELLOWSHIP 

❖ Co-PI (2000-2004): DOE Office of Science National Nanotechnology Initiative. Understanding Nanoscale 
Chemistry: Quantum Dot Interactions with Proteins and Carbon Nanotubes and the Mediation of Inter-Quantum Dot 
Communication (PI: Arthur J. Nozik) 

❖ PI/co-PI (1991-1997): National Natural Science Foundation of China. 
PI:: Molecular systematics of subtribe Astragalinae (Fabaceae), 
Co-PI: Ancient DNA research (PI: Zhangliang Chen) 

Co-PI: Systematics of Polygonatum (PI: Guangyun Rao) 

❖ PI/co-PI (1991-1995): other projects 

NLPGE Research Fellowships. PI: Phylogenetic Reconstruction of Subtribe Astragalinae: Evidence from Restriction 
Site Mapping of A PCR-amplified Fragment Which Encompassed cpDNA genes ndhF and psbA. 
KIB Director Fellowships. PI: Systematics of Astragalinae 

RECORD OF INVENTION 

❖ Michael E. Himmel, Shi- You. Ding. Garry Rumbles and Arthur J. Nozik Generation of Ordered Arrays of Quantum 
Dots Using Cellulosome Protein Scaffolds. NREL New Invention No. 01-05. 

❖ Shi- You Ding. William S. Adney, Todd B. Vinzant, Michael E. Himmel A Thermal Tolerant Avicelase from 
Acidothermus celiulolyticus, NREL Invention Disclosure #01-36, Filed 8/1/01 

❖ William S. Adney, Shi- You Ding . Suzanne McCarter, Michael E. Himmel, Stephen R. Decker, and Todd B. Vinzant. 
A Thermal Tolerant Exoglucanase horn Acidothermus celiulolyticus. NREL Invention Disclosure #01-38, Filed 8/1/01 

❖ Shi- You Ding. William S. Adney, Todd B. Vinzant, Michael E. Himmel. A Thermal Tolerant Multi-Domain Cellulase 
from Acidothermus celiulolyticus. NREL Invention Disclosure #01-37, Filed 8/1/01 

❖ Shi-Y uDing . William S. Adney, Todd B. Vinzant, Michael E. Himmel. A Thermal Tolerant Mannanase from 
Acidothermus celiulolyticus. NREL Invention Disclosure #01-35, Filed 8/1/01 



❖ Michael E. Himmel, Joshua Sakon, William S. Adney, Stephen R. Decker, Suzanne McCarter, Shi- You Ding, Todd 
B. Vinzant and John O. Baker. Site Specific Mutations Conferring Thermal Tolerance to Trichoderma reesi CBHI. 
NREL New Invention No. 01-3 1 . 



PUBLICATIONS 



1. Xu, Q., Gao, W., Ding, S.-Y . Kenig, R., Shoham, Y., Bayer, E. A., Lamed, R. 2003. A cellulosome-integrating gene cluster in Acetivibrio 
cellulolyticus contains a scaffbldin-like gene encoding for a novel type of adaptor protein. J. Bacterid, (in press) 

2. Ding, S.-Y ., Lamed, R., Bayer, E. A., Himmel, M. E. 2003. The bacterial scaffoldin: structure function and potential applications in the 
nanosciences. In Genetic Engineering (J. K. Setlow, ed.) Volume 25, Kluwer Academic/Plenum Publishers, (in press) 

3. Ding S.-Y., William S. Adney, Todd B. Vinzant Stephen R. Decker, John O. Baker, Steven R. Thomas, Michael E. Himmel. Glycosyl 
Hydrolases Gene Cluster of Acidothermus cellulolyticus ACS (in press) 

4. Rincon M. T., Ding, S.-Y .. Mccrae S. I., Martin J. C, Aurilia V., Lamed R., Shoham Y., Bayer, E. A., Flint, H. J. 2003. Novel 
organization and divergent dockerin specificities in the cellulosome system of Rominococcus flavefaciens. J. Bacteriol. 185:703-713. 

5. William S. Adney, Yat-Chen Chou, Stephen R. Decker, Ding, S.-Y., John O. Baker, Todd B. Vinzant, Glenna Kunkel, and Michael E. 
Himmel. Heterologous Expression of Trichoderma reesei 1 ,4-beta-D-glucan cellobiohydrolase. ACS (in press) 

6. Ding, S-Y .. Rincon M. T., Lamed R, Martin J. C, Mccrae S. I., Aurilia V., Shoham Y., Bayer, E. A., Flint, H. J. 2001. Cellulosomal 
Scaffoldin-Like Proteins from Rominococcus flavefaciens. J. Bacteriol. 183:1945-1953. 

7 - Ding, S-Y . Bayer, E. A., Steriner, D., Shoham, Y. and Lamed, R. 2000. An atypical scaffoldin of the Bacteroides cellulosolvens 
cellulosome that contains eleven type-II cohesins. J. Bacteriol. 182:4915-4925 

8. Bayer, E. A., Ding, S-Y . Shoham, Y. and Lamed, R. New perspectives in the structure of cellulosome-related domains from different 
species. In Genetics, biochemistry and ecology of cellulose degradation (K. Ohmiya et al., ed.), 1999. pp. 428-436. Uni Publishers Co., 
Ltd., Tokyo, Japan. 

9. Ding, S-Y. Bayer, E. A., Steriner, D., Shoham, Y. and Lamed, R. 1999. A Novel Cellulosomal Scaffoldin from Acetivibrio cellulolyticus 
That Contains a Family 9 Glycosyl Hydrolase. J. Bacteriol. 181:6720-6729. 

10. Bayer, E. A., Ding, S-Y .. Mechaly, A., Shoham, Y. and Lamed, R. 1999. Emerging phylogenetics of cellulosome structure, p. 189-201. In 
H. Gilbert, G. J. Davies, B. Henrissat, and B. Svensson (eds), Recent advances in carbohydrate bioengineering. The Royal Society of 
Chemistry, Cambridge. 

11. Ding, S-Y .. Gu, H., Qu, L., Chen, Z. A preliminary study on the use of RFLP analysis of the PCR amplified products in the systematic 
investigation of the subtribe Astragalinae (Fabaceae). Acta Bot. Sin. 1995, 7-102. 

12. Ding, S-Y .. Zhang, C, Gu, H., Chen, Z. Progresses of studies on plant systematics at DNA level. Acta Bot. Boreal.-Occident. Sin. 
1996,16(3):578-588. 

13. Ding, S-Y .. Zhang, C, Gu, H., Chen. Z. General review of the methods of molecular systematics of plants at DNA level. Acta Bot. 
Boreal.-Occident. Sin. 1996,16(2): 197-202. 

14. Chu, L., Li, Y., Quan, S., Ding, S-Y .. Suzuki, R, Chen, Z. Sequence analysis and expression in E. coli of RDV S7. Acta Microbiologica 
Sinica, 1996.36(5):335-343. 

15. Ding, S-Y .. Zhang, C, Gu, H., Chen, Z. Plant systematics at DNA level, in The Proceeding of the Symposium of the Postdoctor in China. 
1996, Pp. 938-943. 

16. Chen, Y., Zhang, C, Ding, S-Y . Zhang, Z. The taxonomy and analysis of Artemisia Linn. In the Loess Plateau of northern Shaanxi. Acta 
Bot. Boreal.-Occident. Sin. 1993,13(3):238-245. 

17. Ding, S-Y .. Yu, Z. Systematic signification of leaf structure of Staphyleaceae. Bull. Bot. 1992, 12(2): 177-1 84. 

1 8. Ding, S-Y .. Yu, Z. The preliminary study on chemical taxonomy of Staphyleceae. The 1st Symposium of Systematic Botany in 
Northwestern China. 1992,108-111. 

19. Zhang, Z., Zhang, C, Chen,Y., Ding, S-Y . A preliminary study in the border line of the floristic regionalization in the Loess Plateau of 
northern Shaanxi. ActaBol. Boreal.-Occident. Sin. 1992,12(4):303. 

20. Zhang, Y.. Ding, S-Y . A resource of nectariferous plants in Shaanxi. Acta Bot. Boreal. -Ocedent. Sin. 1991,185-189. 

21 . Ding, S-Y .. Zhang, Y. The conservation plants in northwestern China, in Ecological Environment in Northwestern China. 1991,1 :353- 
364. 

22. Zhang, Z., Zhang, C, Ding, S-Y . Plants in the Loess Plateau. In, The Resource and Application of Plants in the Loess Plateau. 1991,194- 
287. 

23. Ding, S-Y .. Yu, Z. Study on the pollen of Staphyleaceae. Acta Bot. Boreal. -Ocedent. Sin. 1988,8(5): 29-33. 



Abstracts and presentations in international meetings 

1 . Shi-You Ding. Todd B. Vinzant, William S. Adney, Stephen R. Decker, John O. Baker, Ed Jennings, Michael E. Himmel New Glycosyl 
Hydrolases from Acidothermus cellulolyticus poster for 24 th Symposium on Biotechnology For Fuels and Chemicals April 28 - May 1 , 
2002, Gatlinburg, TN 

2. S.-Y. Ding . WS Adney, SE McCarter, JO Baker, SR Decker, TB Vinzant, ME Himmel. Diversity and Conservation of 
Relevant Glycoside Hydrolase Families in Well-studied Cellulolytic Microbes. Proctor Academy, Andover, New Hampshire, 
USA, July 29- Aug.3, 2001. 

3. Marco Rincon, Shi-You Ding. Sheila McCrae, Jennifer Martin, Ed Bayer, Raphael Lamed, Yuval Shoham, Vincenzo Aurilia, and Harry 
Flint. Cellulosomal proteins assembly in rumen cellulolytic bacterium Ruminococcus flavefaciens. Proctor Academy, Andover, New 
Hampshire, USA, July 29- Aug.3, 2001 . 

4. Ding, S-Y . cellulosomes in Acetivibrio cellulolyticus and Bacteroides cellulosolvens. Gordon Research Conference on Cellulases and 
Cellulosomes, Proctor Academy, Andover, New Hampshire, USA, July 25-30, 1999. (Oral presentation) 



5. Ding. S-Y ., Bayer, E. A., Shoham, Y. and Lamed, R., McCrae, S. I., Kirby, J., Aurilia, V., and Flint, H. J. Preliminary evidence of high- 
molecular-weight scaffoldin-like proteins from Ruminococcus flavefaciens. 3 rd Carbohydrate Bioengineering Meeting, University of 
Newcastle Upon Tyne, UK, 1 1 th - 14* April 1999. P5.12 

6. Ding, S-Y ., Steiner, D., Kenig, R., Yaron, S., Morag, E., Shoham, Y., Bayer, E., Lamed, R. Evidence for cohesin and CBD domains, 
characteristic of cellulosomes in non-clostridial anaerobic bacteria. 8* International Symposium on the Genetics of Industrial 
Microorganisms, Jerusalem, Israel, June 28-July 2,1998. p50. 

7. Ding, S-Y ., Steiner, D., Kenig, R., Yaron, S., Morag, E., Shoham, Y., Bayer, E., Lamed, R. Domain organization and sequence of the first 
non-dostridial scaffoldin from Acetivibrio cellulolyticus. Mie Bioforum, Genetics, biochemistry and ecology of cellulose degradation, 
Suzuka, Japan, Sept. 7-1 1, 1998. p47. 

8. Ding. S-Y .» Steiner, D Kenig, R Yaron, S., Morag, E Shoham, Y Bayer, E Lamed, R. Unique Cellulosome-ralated protein in Bacteroides 
cellulosovens. Mie Bioforum 98, Genetics, biochemistry and ecology of cellulose degradation, Suzuka, Japan, Sept. 7-11, 1998. p49. 
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CAZy - Carbohydrate-Active 

enZYmes 



1X Access by 
Home „ J 
Family 



Access by 
Organism 



Family GH5 



Acknowledgements Links Team CitingCAZy Search 



Family GH5 



CAZy Family 
Known Activities 



Glycoside Hydrolase Family 5 

Endoglycosylceramidase (EC 3.2.1.123); P-mannosidase (EC 3.2.1.25); Cellulase (EC 
3.2.1.4); Glucan 1,3-P-glucosidase (EC 3.2.1.58); Licheninase (EC 3.2.1.73); Glucan 
endo-1 ,6-P-glucosidase (EC 3.2.1.75); Mannan endo-1 ,4-P-mannosidase (EC 3.2.1.78); 
Endo-1,4-P-xylanase (EC 3.2.1.8); Cellulose 1 ,4-P-cellobiosidase (EC 3.2.1.91). 



Mechanism Retaining 



Catalytic 
Nucleophile/Base 

Catalytic Proton Donor 

3D Structure Status 

Clan 
Note 

Relevant Links 
Statistics 



Glu (experimental) 

Glu (experimental) 

Available (see PDB). Fold ( P / oc ) 8 

GH-A 

formerly known as cellulase family A. 
HOMSTRAD; InterPro; PFAM; PROSITE 

CA2y(363); GenBank/GenPept (487); Swissprot (177); PDB (43); 3D(13); cryst(1) 



Protein 

endo-1 ,4-glucanase 

endo-1 ,4-glucanase 
xylanase 
xylanase D 

P-mannanase 
exo-1,3-glucanase 1 
mannanase (Cel4A) 

exo-1,3-glucanase2 
(fragment) 

ORF PCZA361.14 

endo-1 ,4-glucanase CelB 

endo-1 ,4-glucanase CelD 

ORF At1g02310yT6A9.1 



Organism 



EC# 
3.2.1.4 



GenBank / GenPept 



Acidothermus 
ceflulolyticus 
Actinomyces sp. 40 
Aeromonas caviae 
Aeromonas caviae 
Agaricus bisporus 
Agaricus bisporus 
Agahcus bisporus C54 
carb8 

Agaricus bisporus Horst 3.2.1 .58 
U1 

Amycolatopsis orientalis 
Anaerocellum 
thermophilum 
Anaerocellum 
thermophilum 

Arabidopsis thaliana n.d. 



3.2.1.4 
3.2.1.8 
3.2.1.8 
3.2.1.78 
3.2.1.58 
3.2.1.78 



n.d. 
3.2.1.4 

3.2.1.4 



U33212 
AX467594 
U94825 
D88553 
U86340 
Z50095 
X92961 
AJ271862 



AAA75477.1 
CAD42489.1 
AAC06196.1 
BAA13641.1 
AAB63573.1 
CAA90423.1 
CAA63536.1 
CAB76904.1 



SwissProt PDB / 3D 
P54583 



1C0DA. 
1ECEA 



066064 
P70733 
024852 
Q92401 

Q12539 

Q9P893 



X92961 CAA63537.1 Q12540 

AJ223998 CAA1 1771.1 052801 

Z86104 CAB06784.1 P96310 

Z77855 CAB01 405.1 Q591 54 



AC064879 AAG00883.1 
AY081352 AAL91241.1 
BT002154 AAN72165.1 
NC_003070NP 171733.1 



Q9FZ29 



http://afinb.cnrs-mrs.fr/CAZY/GH 5.html 
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ORF At1g13130/F3F19.15 


Arabidopsis thaliana 


n.d. 


AC007357 AAD31066.1 
NM_101183NP_172772.1 


ORF At2g20680 


Arabidopsis thaliana 


n.d. 


AC006234 AAD20927.1 


ORF At3g10890/T7M13.3 


Arabidopsis thaliana 


n.d. 


AC011708 AAF19560.1 
NMJ 11926 NP J 87700.1 


ORF At3g10900fl7M13.2 


Arabidopsis thaliana 


n.d. 


AC011708 AAF1 9559.1 
NMJ 1 1927 NPJ87701.1 


ORFAt3g26130/MTC11.2 


Arabidopsis thaliana 


n.d. 


AB024038 BAB02432.1 
NC_003074NP_1 89244.1 


ORF At3g26140/MTC11.4 


Arabidopsis thaliana 


n.d. 


AB024038 BAB02434 1 
NC_003074NP _1 89245.1 


ORF At3g30540/MQP15.4 


Arabidopsis thaliana 


n.d. 


NC_003074 NP_1 89675.1 


ORF At4g28320 


Arabidopsis thaliana 


n.d. 


AL161572 fTAR7Qfi14 1 
NMJ 18972 NP_1 9456 1.1 


ORF 


Arabidopsis thaliana 


n.d. 


BT005831 AA064766 1 


At5g01 930/7201.1 5_200 






AL1 62351 CAB82763.1 
NMJ 20271 NPJ 9581 3.1 


ORF At5g16700/F5E19_40 


Arabidopsis thaliana 


n.d. 


AL391 147 C AC 01834. 1 
NMJ21675 NPJ97172.1 


ORF At5g17500/K3M16_70 


Arabidopsis thaliana 


n.d. 


AL391 1 50 CAC01 894.1 

NMJ 21 756 NPJ 97252.1 


ORF At5g66460/K1F13.12 


Arabidopsis thaliana 


n.d. 


BT000452 AAN17429.1 
AB013389 BAB10922.1 


P-mannanase 






NMJ26044 NP_201447.1 


Aspergillus aculeatus 


3.2.1.78 


L35487AAA67426.1 


endoglucanase li 


Aspergillus aculeatus 


n.d. 


AF543446AAN16396.1 
AY1 73077 AAN7501 9.1 


FII-CMCase 


Aspergillus aculeatus 


3.2.1.4 


AB015510BAA29030.1 


endo-1 ,4-glucanase V 


Aspergillus aculeatus 
WO 94/14953 


3.2.1.4 


AF054512AAC08587.1 


endoglucanase A (EglA) 


Aspergillus kawachii 


n.d. 


AB055431 BAB62317 1 


endoglucanase C (EglC) 


Aspergillus kawachii 


n.d. 


AB055433 BAB62319.1 


endo-1, 4-glucanase B 


Aspergillus niger CBS 


3.2.1.4 


AJ224452 AAE60102.1 


(EglB) 


120.49 




AAE84292 1 
CAA1 1965.1 


endoglucanase (Engl) 


Aspergillus niger 


3.2.1.4 


AF331518AAG50051.1 




IF031125 




unnamed protein product 


Aspergillus oryzae 


n.d. 


AX011537 CAC07551.1 


ORF egl 


Azorhizobium 
caulinodans ORS571 


n.d. 


Z48958 - 


endo-1 ,4-glucanase Cel5A CelSA 


Bacillus agaradherans 


3.2.1.4 


AF067428 AAC19169.1 



endoglucanase A (EngA) Bacillus n.d. AF363635 AAL99668.1 

amyloliquefaciens 
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Q9SKU9 

Q9SG94 

Q9SG95 
Q9LTN0 
Q9LTM8 
Q9LW44 
Q9M0H6 
Q9LZV3 



Q9FJZ3 



Q00012 



074169 
059951 



074706 



085465 1A3H_ 
1E5JA 
1H11 A 
1H2J A 
1H5VA 
1HF5A 
1HF6A 
1HF7A 
10CQ 
1QH2A 
1QI0A 
1QI2A 
2A3H_ 
3A3H__ 
4A3HA 
5A3HA 
6A3HA 
7A3HA 
8A3HA 



2_ 

4/10/2003 



Page 3 of 14 



P-mannanase 
ORF BH0603 

endo-1 ,4-glucanase 3a 
endo-1 ,4-glucanase B 

endo-1 ,4-glucanase 
unnamed protein product 

unnamed protein product 
endo-1 ,4-glucanase 

endo-1, 4-glucanase 
endo-1 ,4-glucanase 
CelS 

endo-1 ,4-glucanase A 
cellulase 



endo-1 ,4-glucanase 
endo-1 ,4-glucanase 

endo-1 ,4-glucanase 
endoglucanase N252 
(Egl252) 

cellulase 

endo-1 ,4-glucanase A 
endo-1 ,4-glucanase B 
endo-1 ,4-glucanase C 
endo-1 ,4-glucanase B1 
cellulase (CelA) 
endo-1 ,4-glucanase 
endo-1 ,4-glucanase 
endo-1 ,4-P-glucanase 
endo-1 ,4-glucanase 

endo-1 ,4-glucanase (EglS 
or BgIC) 



ORF YnfF 

endo-1, 4-glucanase 
BT3043 



BL0157 
BL1333 
BL1761 

,3-glucanase 
blr3367 



UMAS1002 
Bacillus circulans K-1 

Bacillus halodurans C- 
125 

Bacillus lautus 
Bacillus lautus 

Bacillus polymyxa 
Bacillus sp. 

Bacillus sp. 
Bacillus sp. 1139 

Bacillus sp. 22-28 
Bacillus sp. 5H 
Bacillus sp. 79-23 
Bacillus sp. BP-23 
Bacillus sp. CBS 670.93 

Bacillus sp. D04 
Bacillus sp. KSM-635 

Bacillus sp. KSM-64 
Bacillus sp. KSM-N252 

Bacillus sp. KSM-S237 
Bacillus sp. N-4 
Bacillus sp. N-4 
Bacillus sp. N-4 
Bacillus sp. N186-1 
Bacillus sp. NBL420 
Bacillus subtiiis 
Bacillus subtiiis BSE616 
Bacillus subtiiis CHZ1 

Bacillus subtiiis DIG 
Bacillus subtiiis subsp. 
subtiiis str. 168 



Bacillus subtiiis subsp. 
subtiiis str. 168 
Bacteroides ruminicola 
Bacteroides 
thetaiotaomicron VPI- 
5482 

Bifidobacterium longum 
NCC2705 

Bifidobacterium longum 
NCC2705 

Bifidobacterium longum 
NCC2705 
Blumeria graminis 

Bradyrhizobium 
japonicum USDA110 



3.2.1.78 
n.d. 

3.2.1.4 
3.2.1.4 

3.2.1.4 
n.d. 

n.d. 
3.2.1.4 

3.2.1.4 
3.2.1.4 
3.2.1.4 
3.2.1.4 
3.2.1.4 

3.2.1.4 
3.2.1.4 

3.2.1.4 
3.2.1.4 

3.2.1.4 
3.2.1.4 
3.2.1.4 
3.2.1.4 
3.2.1.4 

n.d. 
3.2.1.4 
3.2.1.4 
3.2.1.4 

3.2.1.4 
3.2.1.4 



AB007123 BAA25878.1 



n.d. 

3.2.1.4 
n.d. 

n.d. 

n.d. 

n.d. 

3.2.1. - 
n.d. 



AP001509 
NC_002570 
A28175 
M33762 
A28172 
M33791 

AX665431 
AX643592 
M15743 
D00066 
D85236 
AB016164 
AF045482 
Y12512 
A76937 
A79880 
AX665429 
U27084 
M27420 



BAB04322.1 
NP_241 469.1 
CAA01935.1 

AAA22408.1 
CAA01 934.1 
AAA22631.1 

AAE86309.1 

CAD82874.1 

CAD67813.1 

AAA22305.1 
BAA00045.1 
BAA12744.1 
BAA31 712.1 
AAC02536.1 
CAA73113.1 
CAB58698.1 
CAB59165.1 
CAD82873.1 
AAC43478.1 
AAA22304.1 



M84963AAA73189.1 
AB057671 BAB62295.1 



AB018420 
M14781 
M 14729 
M25500 
233876 

AY039744 
M28332 
D01057 

AY044252 

M16185 
AF355629 
X04689 
X67044 
Z29076 
Z73234 
299113 
NC_000964 

Z73234 
299113 
M38216 
AE016938 
NC_004663 

AE014632 
NC_004307 

AE014762 
NC_004307 

AE014810 
NC_004307 

AF317734 

AP005947 
NC 004463 



BAB1 9360.1 

AAA22301.1 
AAA22299.1 
AAA22306.1 
CAA83942.1 
AAK73277.1 
AAA22307.1 
BAA00859.1 
AAK94871.1 
AAA22496.1 
AAK39540.1 
CAA28392.1 
CAA47429.1 
CAA82317.1 
CAA97610.1 
CAB13696.1 
NP_389695.1 
CAA97612.1 
CAB13698.1 
AAA22909.1 

AA078149.1 

NP_81 1955.1 

AAN24012.1 

NP_695376.1 

AAN25133.1 

NP_696497.1 

AAN25544.1 

NP_696908.1 
AAL26905.1 

BAC48632.1 
NP 770007.1 



066185 

Q9KF82 

P23550 
P23548 

P06564 

Q45554 
083012 
052731 
008342 



Q45070 
Q44878 



1LF1 A 



Q45430 

P19424 1G01A 
1G0CA 

Q59241 



P06566 
P06565 
P19570 
Q59232 

Q45532 
P23549 

P07983 
P10475 
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endo-1 ,4-glucanase A 


Butyrivibrio fibrisolvens 
A46 


3.2.1.4 


M37031 AAA20893.1 


P22541 


endo-1,4-glucanase 1 


Butyrivibrio fibrisolvens 


3.2.1.4 


X17538CAA35574.1 


P20847 




H17c 








P-1,4-mannanase (ManA) 


Caldibacillus 


3.2.1.78 


AF1 63837 AAF22274.1 


Q9RFX5 


cellulovorans 








P-mannanase/endo-1 ,4- 


Caldicellulosiruptor 


3.2.1.4 


L01 257 AAA71 887.1 


P22533 


glucanase 


saccharolyticus 


3.2.1.78 


M36063AAA72861.1 




endo-1,4- 


Caldicellulosiruptor 


3.2.1.91 


X13602 CAA31 936.1 


P10474 


giucanase/cellobiohydrolase 


saccharolyticus 


3.2.1.4 






CelB (multidomain protein) 


Caldicellulosiruptor sp. 


n.d. 


AF078737AAD30364.1 


Q9X3P6 




Tok7B.1 








exo-1,3-glucanase 


Candida albicans 


3.2.1.58 


X56556 CAA39908.1 


P29717 


t ,o-r'-giucanase (caoI) 


wal IKJILIO KJIOWfJI IliO 


n rl 
11. u. 


nrOooOUD AANIa i4DSj.1 




UKr uOUoUl 


Caulobacter crescentus 


n.d. 


AE005756 AAK22786.1 


Q9AA10 




CR15 








onHrt-1 A.nlnr9naco Pi 
ollUU I ,*r*-yiU0aI IdotJ Lf 


oe/OiS Ksetiuiouionas Ttmi 


O O A A 


1 AOCit A AAA OOAOn A 

L02544 AAA23089.1 


P50400 


endo-1 ,4-glucanase B (cfIB) 


Cellulomonas flavigena 


3.2.1.4 


AF1 72345 AAD48494.2 




endo-b1,4-mannanase 5A 


Man5A Ceilvibrio japonicus 


n.d. 


AY1 87031 AA031 759.1 




endo-b1,4-mannanase 5B 


Man5B Ceilvibrio japonicus 


n.d. 


AY1 87032 AA031 760.1 




endo-b1,4-rriannanase 5C 


Man5C Ceilvibrio japonicus 


n.d. 


AY1 87033 AA031 761.1 




cellodextrinase C 


Ceilvibrio japonicus 


3.2. 1.- 


X61299 CAA43597.1 


P27033 




(formerly Pseudomonas 










cellulose) 








endo-1,4-glucanase E 


Ceilvibrio japonicus 


3.2.1.4 


X86798 CAA60493.1 


Q59665 




(formerly Pseudomonas 








cellulosa) 








endo-1 ,4-glucanase A 


Ceilvibrio mixtus 


3.2.1.4 


AF003696AAB61 461.1 


007652 


endo-1 ,4-glucanase B 


Ceilvibrio mixtus 


3.2.1.4 


AF003697 AAB61462.2 


007653 


ORF manB (fragment) 


Chlorella vulgaris C-169 


n.d. 


AB036816 BAA99563.1 




Cellulase CelA 


Clavibacter 


3.2.1.4 


X62582 CAA44467.2 


Q9K5C7 




michiganensis NCPPB 










382 








cellulase CelA 


Clavibacter 


3.2.1.4 


AY007311 AAK1 6222.1 


Q9AF65 




michiganensis subsp. 










sepedonicus 








endo-1 ,4-glucanase C307 


Clostridium sp. F1 


3.2.1.4 


D00945 BAA00793.1 


P23340 


endo-1 f 4-glucanase 


Clostridium 


3.2.1.4 


M31311 AAA23230 1 


P 1570 4 




acGtobutviicijm 






ORF CAC0706 


Clostridium 


n.d. 


AE007585 AAK78683 1 


Q97L56 




acetobutylicum ATCC 










824 








ORF CAC0825 


Clostridium 


n.d. 


AE007598 AAK78801 1 


OQ7KIJ1 




acetobutylicum A TCC 










824 








ORF CAC0826 


Clostridium 


n.d. 


AE007598 AAK78802 1 






acetobutylicum ATCC 










824 








ORF CAC0912 


Clostridium 


n.d. 


AE007607 AAK78888 1 


V*<%7 f f\f\\J 




acetobutylicum ATCC 










824 








ORF CAC0918 


Clostridium 


n.d. 


AE007607 AAK7 8894.1 


Q97KK1 




acetobutylicum ATCC 










824 








ORF CAC3469 


Clostridium 


n.d. 


AE007844 AAK81 397.1 


Q97DK6 




acetobutylicum ATCC 










824 








ORF CAP0118 


Clostridium 


n.d. 


AE001438 AAK76863.1 


Q97TI3 



1CZ1 A 
1EQCA 
1EQPA 



i 
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ac tobutylicum ATCC 




NC_001988 NP_1 492811 








824 








ORF CAP0119 




Clostridium 
acetobutylicum ATCC 
824 


n.d. 


AE001438 AAK76864.1 
NCJ)01988 NPJ49282.1 


Q97TI2 


P-1,4-glucanase (Cel5l) 


CelSI 


Clostridium 


n.d. 


AY077754AAL79562.1 






cellulolyticum 








cefluiase CeIN 




Clostridium 
cellulolyticum 


3.2.1.4 


AF316823 AAG45162.1 


Q9EYQ0 


endo-1 ,4-glucanase A 




Clostridium 
cellulolyticum 


3.2.1.4 


M32362 AAA232211 
M93096AAA51 444.1 


P17901 


endo-1,4-glucanase D 




Clostridium 
cellulolyticum 


3.2.1.4 


D90341 BAA14354.1 


P25472 


mannanase ManK 




Clostridium 
cellulolyticum 


3.2.1.78 


AF3 16823 AAG45159 1 


Q9EYQ3 


endo-1 ,4-glucanase 5A 


Cel5A 


Clostridium 


3.2.1.4 


U37056 AAB40891.1 


P94622 


(EngF) 




cellulovorans 








endo-1 ,4-glucanase B 




Clostridium 
cellulovorans 


3.2.1.4 
3.2.1.8 
3.2.173 


M75706 AAA23231.1 


P28621 


endo-1, 4-glucanase D 




Clostridium 
cellulovorans 


3.2.14 


M37434 AAA23233.1 


P28623 


endo-1 ,4-glucanase E 




Clostridium 


3.2.1.4 


AF1 05331 AAD39739.1 


Q9XD99 


(EngE) 




cellulovorans 








mannanase A (ManA) 




Clostridium 
cellulovorans 


3.2.178 


AF1 32735 AAF06110.2 




endo-1 ,4-glucanase A 




Clostridium josui 


3.2.1.4 


D85526BAA12826.1 


Q59290 


endo-1 ,4-glucanase (CelA) 




Clostridium longisporum 


3.2.1.4 


L02868AAC37035.1 


P54937 


endo-1 ,4-glucanase E 


CelSC 


Clostridium 


3.2.14 


M22759 AAA23224.1 


P10477 


1 ,4-P-cellobiohydrolase 




thermocellum 










Clostridium 


3.2.191 


AJ275975 CAB76938.1 




(CelO) 




thermocellum F7 








endo-1 ,4-glucanase C 




Clostridium 


3.2.14 


AJ307315CAC27410.1 




(CeIC) 




thermocellum F7 






endo-1 ,4-glucanase B 


Cel5A 


Clostridium 


3.2.14 


AUOJOt Unruf ^QQ. I 


P04956 






thf*rmc\r frlli im NOiR 

If Iwl 1 1 l\J\*di*JI 1 1 f V \~/ 1 lD 










10682 








endo-1, 4-glucanase C 


Cel5B 


Cln ^trieiii im 

\^ IU/UI II 

thermocellum NCIB 
10682 


1 / 

0.*£. I .*+ 




P07985 


endo-1 ,4-glucanase G 


Cel5D 


Clostridium 


3.2.14 


X69390 CAA49187.1 


Q05332 






thermocellum NCIB 










10682 








endo-1 ,4-glucanase H 


Cel26A- 


Clostridium 


3.2.1.4 


M31903 AAA23225.1 


P16218 


(CelH) 


-Cel5E 


thermocellum NCIB 
10682 






exo-1,3-P-glucanase 




Cochliobolus carbonum 


3.2.158 


AF229446 AAF65310.1 




endo-1 ,4-glucanase 1 




Cryptococcus flavus 


3.2.1.4 


S45137 AAC605411 


Q04469 


exo-P-1 ,3-glucanase 








D13967 BAA03070.1 






Cryptococcus 


n.d. 


AF243533 AAM 26269.1 




(fragment) 




neoformans var. 










neoformans 








endo-P-mannanase 




Datura ferox 


n.d. 


AF323949 AAO06964.1 




(fragment) 












exo-1 ,3-glucanase 




Debaryomyces 


3.2.1.58 


Z46871 CAA869511 


Q12700 






occidentalis 










(S. occidentalis) 








endo-1 ,4-glucanase A 




Emericella nidulans 


3.2.14 


AB009402 BAA82592.1 




(eglA) 










cellulase CelA 




Epidinium caudatum 


n.d. 


AB104618BAC57895.1 





1CEN 
1CEO 
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endo-1,4-glucanase 


Epidinium caudatum 


endoglucanase Epi2 


Epidinium caudatum 


endo-1 ,4-glucanase N 


Erwinia carotovora 




atroseptica 


endo-1 ,4-glucanase V1 


Erwinia carotovora 




SCC3193 


endo-1,4-glucanase V 


Erwinia carotovora 




SCRI193 


endo-1 ,4-glucanase CelA 


Erwinia carotovora 


(fragment) 


subsp. carotovora LY34 


xylanase 


Erwinia chrysanthemi 


endo-1 ,4-glucanase Z 


Cel5A Erwinia chrysanthemi 




3937 


endo-1 ,4-glucanase 3 


Fibrobacter intestinalis 


endo-1 ,4-glucanase 3 


Fibrobacter 




succinogenes 


endo-1, 4-glucanase G 


Fibrobacter 




succinogenes 


cellodextrinase A (CedA) 


Fibrobacter 




succinogenes S85 


CMC-xylanase (fragment) 


Fibrobacter 




succinogenes S85 


cellulase (Celc) 


Filobasidiella 




neoformans 309 Cap67 


P-mannanase 


Geobacillus 




stearothermophiius 




MCA2184 


endoglucanase 1 (Engl) 


Giobodera rostochiensis 


endoglucanase 2 (Eng2) 


Globodera rostochiensis 


P-1 ,4-endoglucanase 


Giobodera rostochiensis 


precursor (Eng-3) 


Ro-1 Mierenbos 


R A A — | 

h-1 ,4-endoglucanase 


Giobodera rostochiensis 


precursor (Eng-4) 


Ro-1 Mierenbos 


endoglucanase 1 (Engl) 


Heterodera giycines 


endoglucanase 2 (Eng2) 


Heterodera glycines 


endoglucanase 3 


Heterodera glycines 


nuuyiuudndSc *t 


Heterodera glycines 


endo-1 ,4-glucanase 


Humicola grisea 


endo-1 ,4-glucanase IV 


Humicola insolens 


P-1,6-glucanase precursor 


Hypocrea virens 


(Bgn3) 




exo-1 ,3-glucanase 


Kluyveromyces lactis 


mannanase (Lm1) 


Lactuca sativa 


Xyl38 


Leptosphaeria maculans 


LB251 


Leptospira interrogans 




serovar lai sir. 56601 


P-1,4-mannanase (Man2) 


Lycoperslcon 


esculentum 


P-mannanase 


Lycopersicon 




esculentum 


endo-P-mannanase 


Lycopersicon 




esculentum 
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3.2.1.4 


AB011273 BAA76394.1 






n.o. 


AblU4Ol0 dAC37o?3.1 






3.2.1.4 


L39788 AAC37033.1 


Q59394 




3.2.1.4 


X79241 CAA55823.1 


Q59395 




O.Z.I A 


A/ouuu u/VAOooy^.i 


U47Q96 




3.2.1.4 


AF025768 AAC02964.1 


031029 




3.2.1.8 


U41750 AAB53151.1 


Q46961 


1NOF 


o._M.4 


YUU04U WaA0oDU4.1 


P07103 


1EbZ A 


0.-..1 .4 




Q59440 




3.2.1.4 


M29047AAA24893.1 


P14250 




3.2.1.4 


U33887 AAB38548.1 


Q47916 




3.2.1. - 


U07419AAA50210.1 


Q59441 




3.2.1.4 
3.2.1.8 


U94826 AAC06197.1 


066065 




n.d. 


AJ486863 CAD31 1101 

* »w % www Wfi V v III \i • 1 






3.2.1.78 


AF038547 AAC71692 1 

* " w w^ i #^W^%%^ 1 1 Vvh* 1 






3.2.1.4 


AF004523 AAC48325.1 
AF056110AAC63988.1 


016028 




3.2.1.4 


AF004716AAC48341.1 
AF056111 AAC63989.1 


044078 




3.2.1.4 


AF408154AAN03645.1 
AF408155 AAN03646.1 






3.2.1.4 


AF408156 AAN03647.1 
AF408157 AAN03648.1 






3.2.1.4 


AF052733 AAC 15707.1 
AF006052 AAC48327!l 
A79341 CAB59144.1 


018453 




3.2.1.4 


AF006053 AAC48326.1 
A79340 CAB59143.1 


018454 




3.2.1.4 


AF044210AAC33848.1 
AF056048 AAC33860.1 


077449 




3.2.1.4 


AY043224 AAK85303.1 






3.2.1.4 


D84470 BAA1 2676.1 


Q12620 




3.2.1.4 


X76046 CAA53631.1 


Q12624 




n.d. 


AF395757AAL84696.1 







3.2.1.58 


Z46869 CAA86949.1 


Q12628 


n.d. 


AJ315978 CAC51690.1 
AJ315978CAC51690.2 


Q93X40 


n.d. 


AF487263 AA049459.1 




n.d. 


AE011612AAN51810.1 
NC_004343 NP_71 4795.1 




3.2.1.78 


AF1 84238 AAG00315.1 




3.2.1.78 


AF017144 AAB87859.1 


048540 


3.2.1.78 


AY034075 AAK56557.1 
AY046589 AAK97759.2 





0 
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endo-1 ,4-glucanase 1 


Macrophomina 


3.2.1.4 




phaseolina 




I lUvi I ,*t~y lUUdildoc* £. 


Macropnomtna 




P-1 ,4-endoglucanase Eng- 


ohaseolina 




Meloidogyne arenaria 


3.2.1.4 


1i (fragment) 








Meloidogyn& arenaria 


3.2.1.4 


1ii (fragment) 






|JL1 ^>AnHnnli iransco Pnn. 
r™ i ,*t"Ci luuyiUOdJ Idoc d iy- 


Meloidogyne hapla 


3.2.1.4 


1i (fragment) 






r^* i (t-tsi juuyiuuaiidotJ cng- 

1a 


Meloidoovne incoanita 


3.2.1.4 




Meloidoovne incoanita 


3.2.1.4 


1i (fragment) 






LJL A 4.onrtrtnli ironoco Chn 

r* - i .^t-oiiuuyiuudndoo cng- 


Meloidoovne innnnnita 


3.2.1.4 


1ii (fragment) 






r*~ i ,h*oi luuyiuLrdi idot/ cny- 


Meloidoovne incoanita 


3.2.1.4 


1 iii (fragment) 






~ I ,*t-tsi luuy lUbdlldoc r_iiy- 


Meloidoovne incoanita 


3.2.1.4 


1 i v (fragment) 






P-1,4-endoglucanase Eng- 


Meloidogyne incognita 


3.2.1.4 


1v (fragment) 






r- 1 ,*t-enuoy lucanase cng- 


Meloidoovne innnanita 




1vi (fraoment) 






P-1,4-endog lucanase Eng-2 


Meloidogyne incognita 


3.2.1.4 


(fragment) 






P-1 ,4-endoglucanase Eng- 


Meloidogyne incognita 


3.2.1.4 


P-1 f 4-endoglucanase Eng- 


Meloidogyne incognita 


3.2.1.4 


£.\J ^lidyillcruj 






endo-1 ,4-glucanase 1 (Eng- 


Meloidogyne incognita 


3.2.1.4 


xyianase (Ayi-i ) 


Meloidogyne incognita 


3.2.1.8 


P-1 ,4-endoglucanase Eng- 


Meloidogyne javanica 


3.2.1.4 


1i (fragment) 






P-1 ,4-endoglucanase Eng- 


Meloidogyne javanica 


3.2.1.4 


1ii (fragment) 






Mlr2086 


Mesorhizobium loti 


n.d. 




MAFF303099 




Mlr3893 


Mesorhizobium loti 


n.d. 




MAFF303099 




endo-1 ,4-mannanase 


Mytiius edulis 


3.2.1.78 


endo-1 ,4-glucanase A 


Neocallimastix frontalis 


3.2.1.4 


endo-1 ,4-glucanase B 


Neocallimastix 


3.2.1.4 


(CelB) 


patriciarum 




endo-1 ,4-glucanase D 


Neocallimastix 


3.2.1.4 


(CelD) 


patriciarum 




P-1,6-glucanase 


Neotyphodium sp. 


3.2.1.75 






ORF B23L21.220 


Neurospora crassa 


n.d. 


ORF Alr0290 


Nostoc sp. PCC 7720 


n.d. 


endo-1 ,4-glucanase 


Orpinomyces joyonii 


3.2.1.4 


endo-1 ,4-glucanase CelB 


Orpinomyces joyonii 


3.2.1.4 


endo-1 ,4-glucanase CelB29 


Orpinomyces joyonii 


3.2.1.4 



AY046588 AAK97760.1 
U14948 AAB51451.1 



Q12638 
Q12637 



U13914AAB03889.1 

AF323097AAK21892.1 

AF323098 AAK21 893.1 

AF323096 AAK21 891.1 

AF323087 AAK21882.1 Q9BJZ1 
AF323090.1 AAK21885.1 

AF323091 AAK21886.1 

AF323092 AAK21 887.1 

AF323093 AAK21888.1 

AF323094 AAK21 889.1 

AF323095 AAK21890.1 

AF323086AAK21881.1 

AF323088AAK21 883.2 
AF323089.1 AAK21 884.1 

AF1 00549 AAD45868.1 Q9UA57 

AF224342 AAF37276.1 
AF323099 AAK21894.1 

AF323100 AAK21 895.1 

NC_002678 NP_1 03519.1 

NC_002678NP 104903.1 



AJ271365CAC81056.1 
U38843AAC63094.1 
X77186 AAE59927.1 
Z31364 CAA83238.1 
CAA83238.1 

AF053363 AAC06321.1 

AF535131 AAN04103.1 

AL356172 CAB91 690.1 
AP003581 BAB77814.1 
NC_003272 NP_484334.1 
U59432AAC49731.1 
AF015249AAB69348.1 
AF015248 AAB69347.1 



Q01409 
Q12647 

059943 



P87211 
013334 
013333 



cryst 



7 
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endo-l,4-glucanase B 


Orpinomyces sp. PC-2 


3.2.1.4 


U57818AAD04193.1 








AAE59925.1 


mannanase ManA 


Orpinomyces sp. PC-2 


n.d. 


AF1 77206 AAL01213.1 


OJ1208D02.11 


Oryza sativa 


n.d. 


AC1 07314 AAM08620.1 




(japonica cultivar-group) 






OJ1208D02.5 or 


Oryza sativa 


n.d. 


AC107314 AAM08614.1 


OJ1003C07.1 


(japonica cultivar-group) 




AC1 13335 AAM08821.1 


OSJNBa0010E04.2 


Oryza sativa 


n.d. 


AC096687 AAL79758.1 




(japonica cultivar-group) 






OSJNBa0010E04.20 


Oryza sativa 


n.d. 


AC096687 AAL79761.1 




(japonica cultivar-group) 






OSJNBb0011N17.6 


Oryza sativa 


n.d. 


AL606614CAD41089.1 




(japonica cultivar-group) 






OSJNBb0011N17.7 


Oryza sativa 


n.d. 


AL606614CAD41090.1 




(japonica cultivar-group) 






OSJNBb0011N17.8 


Oryza sativa 


n.d. 


AL606614CAD41091.1 




(japonica cultivar-group) 






P0481E12.1 


Oryza sativa 


n.d. 


AP003076 BAB56016.1 


(OSJNBa0014K08.29) 


(japonica cultivar-group) 




AP003376 BAC05600.1 


P0671D01.24 


Oryza sativa 


n.d. 


AP003284 BAB91747.1 


endo-P-1 ,4-glucanase 


(japonica cultivar-group) 






Paenibacillus sp. 


n.d. 


AF345984 AAL83749.1 


KCTC8848P 






43 KDa glycoprotein 


Paracoccidioides 


n.d. 


U26160 AAC49253.1 




brasiliensis B339 






43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047690 BAB1 2190.1 


(fragment) 


brasiliensis IFM41620 






(=Pb-9) 






43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047691 BAB12191.1 


(fragment) 


brasiliensis IFM41621 






(=Pb-18) 






43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047693 BAB12193.1 


(fragment) 


brasiliensis IFM41624 








(=Bt-4) 






43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047694 BAB1 2194.1 


(fragment) 


brasiliensis IFM41626 






(=Bt-9) 






43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047695 BAB12195.1 


(fragment) 


brasiliensis IFM41632 




43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047696 BAB12196 1 


(fragment) 


brasiliensis IFM46215 




43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047700 BAB12200.1 


(fragment) 


brasiliensis IFM46463 






(=Tatu) 






43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047692 BAB12192.1 


^ii ciy 1 1 ic/i iif 


brasiliensis IFM46465 






(=Pb-267) 






43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047699 BAB12199.1 


(fragment) 


brasiliensis IFM46467 




43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047698 BAB12198 1 


(fragment) 


brasiliensis IFM46468(P- 
25) 




43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047697 BAR121Q7 1 


(fragment) 


brasiliensis IFM46930 




43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047701 BAB1 2201.1 


(fragment) 


brasiliensis IFM47183 






(=PRT1-Tatu 1) 






43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047702 BAB12202.1 


(fragment) 


brasiliensis IFM47185 






(=PRT2 Y) 






43 kDa glycoprotein (Gp43) 


Paracoccidioides 


n.d. 


AB047704 BAB 12204.1 
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(fragment) 


brasiliensis IFM47217 




(=D4S1=Tatu 4) 


43 kDa glycoprotein (Gp43) 


Paracoccidioides 


(fragm nt) 


brasiliensis IFM47217 




(=D4S9) 


43 kDa glycoprotein <Gp43) 


Paracoccidioides 


(fragment) 


brasiliensis IFM47247 




(=D4LIV1) 


endo-xylanase (XynA) 


Pectobacterium 




II yoaiiu iZ/ii it ruUv^ (i7 


P»ndo-1 4-nlurc)nn<%p II 

V*l IUU 1 | » H'UvvilluOC II 


~ciii{si!iiuiii janinirwiiurn 


ORF Exo1 


Phvtnnhfhnrfi in fasten s 

I 1 lyiKJfJI III IKJI a frf/C7O10//O 


ORF Exo2 


Phx/tnnhthnm infadan e 
iiiyi\jfJiiiiiLfia if II Cola f fo 


ORF Exrrt 


"nyiopninora inTesians 


exo-1 ,3-glucanase 


Pichia angusta 


exo-1 ,3-glucanase 


Pichia anomala K 


endoglucanase 5A 


Cel5A Piromyces equi 


endoglucanase (egl-1) 


Piromyces rhizinflata 


cellulase CelA (fragment) 


Polyplastron 




multivesiculatum 


endo-1,4-glucanase 


Prevotelia ruminicofa 


endo-1 ,4-glucanase 


Prevotella ruminicoia 


(OKF4) 




xylanase 


Prevotelia ruminicoia 


cellulase (PhEG) 


Psacothea hilaris 


CeIG 


Pseudoalteromonas 




haloplanktis A23 


PP1682 


Pseudomonas putida 


P-1,4-mannanase (AmnA) 


KT2440 


Pseudomonas sp. 


ND137 


cellulase (AcIA) 


Pseudomonas sp. 




ND137 


PSPTO0905 


Pseudomonas syringae 




pv. tomato str. DC3000 


PSPT03292 


Pseudomonas syringae 




pv. tomato str. DC3000 


ORF PAB0632 


Pyrococcus abyssi 


endoglucanase (PH1171) 


Pyrococcus horikoshii 




OT3 


ndo-1 ,4-glucanase 


Ralstonia solanacearum 




AW 


endoglucanase (fragment) 


Ralstonia solanacearum 




CFBP2047 


endoglucanase (fragment) 


Ralstonia solanacearum 




CFBP2957 


endoglucanase (fragment) 


Ralstonia solanacearum 




CFBP2958 


endoglucanase (fragment) 


Ralstonia solanacearum 




CFBP2972 


endoglucanase 


Ralstonia solanacearum 




CFBP3059 


endoglucanase (fragment) 


Ralstonia solanacearum 




CFBP3858 


ndoglucanase (fragment) 


Ralstonia solanacearum 




CFBP712 


ndoglucanase (fragment) 


Ralstonia solanacearum 




CFBP715 


endoglucanase 


Ralstonia solanacearum 
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n.d. 


AB047703 BAB1 2203.1 




n.d. 


AB047705 BAB1 2205.1 




n.o. 


MY UOO MU AAL1 641 5.1 




3.2.1.4 


X89564 CAA61 740.1 




n.d. 


AF494014 AAM1 8483.1 




n.d. 


AF494016AAM18485.1 




n H 

n.u. 


Ar4y4U1 0 AAM1 84o4.1 




3.2.1.58 


246868 CAA86948.1 


Q12626 


3.2.1.58 


AJ222862 CAA1 1018.1 


093983 


3.2.1.4 


AJ277483 CAB92326.1 




3.2.1.4 


AF1 65266 AAD43818.1 


Q9Y870 


n.d. 


AB104619 BAC57896.1 




3.2.1.4 


AB022867 BAA74515.1 


Q9ZN63 


3.2.1.4 


U96771 AAC97596.1 


006842 


3.2.1.8 


M83379 AAC36862.1 


Q45397 


n H 


ARftAftOR£ DADOCQC7 4 

MDUOU/iOO dAdodod7.1 




3.2.1.4 


Y17552 CAA76775.1 


086099 


n.d. 


AE016780AAN67303.1 
NC_002947 NP_743839.1 




n.d. 


AB063258 BAB79290.1 
AB063258 BAB79290.2 




ad. 


AB063256 BAB79288.1 




n.d. 


AE016859 AA054439.1 
InU_UU40/o lNr_/yuY44.1 




n.d. 


AE016867 AAO56770.1 

Mf f\(\AtZ7Q MD TQQA7C A 




n.d. 


AJ248285 CAB49854.1 

Mr* nnriftftft md -ioraoo a 
InO_UUUOOO IN ^00^0.1 


Q9V052 


3.2.1.4 


AP000005BAA30271.1 


058925 


3.2.1.4 


M84922 AAA61 980.1 


P17974 


n.d. 


AF295262 AAK97112.1 




n.d. 


AF295265 AAK97115.1 




n.d. 


AF295266 AAK97116.1 




n.d. 


AF295264 AAK97114.1 




n.d. 


AF295270 AAK97120.1 




n.d. 


AF295259AAK97109.1 




n.d. 


AF295267AAK97117.1 




n.d. 


AF295268 AAK97118.1 




n.d. 


AF295274 AAK97124.1 
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endoglucanase (fragment) 
ORF Egl 

endoglucanase (fragment) 

endoglucanase 

endoglucanase (fragment) 

endoglucanase (fragment) 

endoglucanase (fragment) 

endoglucanase 

endoglucanase 

endoglucanase (fragment) 

endoglucanase 

endoglucanase 

ndoglucanase (fragment) 

endoglucanase 

endoglucanase (fragment) 

endoglucanase 

endoglucanase (fragment) 

endoglucanase (fragment) 

endoglucanase (fragment) 

endoglucanase (fragment) 

endoglucanase (fragment) 

endoglucanase (fragment) 

endoglucanase (fragment) 

endoglycoceramidase 
endoglycoceramidase II 
endo-1,4-glucanase I 

ndo-1,4-glucanase4 

endo-1 ,4-glucanase A 

endo-1 ,4-glucanase B 
endo-1 ,4-glucanase I 
xyianase (fragment) 

P-1,4-endoglucanase VII 



CFBP734 

Ralstonia solanacearum n.d. 
GMI1000 

Ralstonia solanacearum 3.2. 1 .4 
GMI1OO0 

Ralstonia solanacearum n.d. 
ICMP7963 

Ralstonia solanacearum n.d. 
J25 

Ralstonia solanacearum n.d. 
J25 

Ralstonia solanacearum n.d. 
JT516 

Ralstonia solanacearum n.d. 
JT523 

Ralstonia solanacearum n.d. 
JT525 

Ralstonia solanacearum n.d. 
JT52S 

Ralstonia solanacearum n.d. 
MAFF211266 

Ralstonia solanacearum n.d. 
NCPPB1018 

Ralstonia solanacearum n.d. 
NCPPB283 

Ralstonia solanacearum n.d. 
NCPPB3190 

Ralstonia solanacearum n.d. 
NCPPB342 

Ralstonia solanacearum n.d. 
NCPPB3987 

Ralstonia solanacearum n.d. 
NCPPB505 

Ralstonia solanacearum n.d. 
R230 

Ralstonia solanacearum n.d. 
R292 

Ralstonia solanacearum n.d. 
UW151 

Ralstonia solanacearum n.d. 
UW162 

Ralstonia solanacearum n.d. 
UW469 

Ralstonia solanacearum n.d. 
UW477 

Ralstonia solanacearum n.d. 
UW9 

Rhodococcus sp. C9 3.2.1.123 

Rhodococcus sp. M-777 3.2.1.123 

Robiilarda sp. Y-20 3.2.1 .4 
(Pestalotiopsis sp.) 

Ruminococcus albus 3.2.1 .4 



Ruminococcus albus 3.2.1 .4 



Ruminococcus albus 3.2. 1 .4 
Ruminococcus albus 3 .2. 1 .4 
Ruminococcus albus n.d. 



Ruminococcus albus F- 3.2. 1 .4 



AF295251 AAK97101.1 




AL646076 CAD17313.1 




NC 003296 NP_521 723.1 




AF295263 AAK97113.1 




AF295279 AAK97129.1 




AF295276 AAK97126.1 




AF295258 AAK97108.1 




AF295252AAK97102.1 




AF295272 AAK97122.1 




AF295273 AAK97123.1 




AF295250AAK97100.1 




AF295271 AAK97121.1 




AF295275AAK97125.1 




AF295253 AAK97103.1 




AF295278 AAK97128.1 




AF295261 AAK97111.1 




AF295277 AAK97127.1 




AF295280 AAK97130.1 




AF295255 AAK97105.1 




AF295254 AAK97104.1 




AF295256AAK97106.1 




AF295269 AAK97119.1 




AF295260 AAK971 10.1 




AF295257 AAK97107.1 




AB042327 BAB17317.1 




U39554 AAB67050.1 


033853 


AB030819 BAA90480.1 


P23044 


AB016777 BAA32286.1 


Q07940 


D16315BAA32286.1 




L10243 AAA26467.1 


P23660 


X54931 CAA38692.1 


Q59733 


X54932 CAA38693.1 


P23661 


M30928 AAA26469.1 


P16216 


AB057589 BAB39494.1 




AB057589 BAB39495.1 




AB028321 BAA92430.1 
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(EgVII) 


40 








endo-1 ,4-glucanase V 


Ruminococcus albus F- 


3.2.1.4 


AB028320 BAA92146.1 




(EgV) 


40 








endo-1 ,4-glucanase A 


Ruminococcus 


3.2.1.4 


S55178 AAB19708.1 


005143 


(EndA) 


flavefaciens 17 




Z83304CAB05881.1 


Q53302 


celiodextrinase 


Ruminococcus 
flavefaciens FD-1 


3.2.1. - 


X51944 CAA36207.1 


P16169 


EXG1 


Saccharomyces bayanus 
623-6C 


n.d. 


AY144818AA032382.1 




SPR1 


Saccharomyces bayanus 
623-6C 


n.d. 


AY144819AA032383.1 




EXG1 (fragment) 


Saccharomyces castellii 
CBS4309 


n.d. 


AY144925AA032489.1 




EXG1 (fragment) 


Saccharomyces castellii 
CBS4309 


n.d. 


AY144926 AAO32490.1 




exo-1 ,3-glucanase 


Saccharomyces 


3.2.1.58 


S52935AAB24895.1 


P32603 


(sporulation) 


cerevisiae GRF88 




X59259 CAA41 952.1 
Z75098 CAA99399.1 




exo-1 ,3-glucanase 1 


nrhfi mmvt*p <; 
cerevisiae S288C 




U17243 AAB67345.1 
NC_001144 NP_013403.1 




exo-1 ,3-glucanase 2 


Saccha romyces 
cerevisiae S288C 


3.2.1.58 


Z68329CAA92719.1 
Z70202 CAA94100.1 
NCJ)01136 NP_01 0547.1 


rD£&i 1 


ORF YBR056W 


Saccharomyces 
cerevisiae S288C 


n.d. 


Z35925 CAA84999.1 
Z46260 CAA86399 1 
NC_001134 NP_009612.1 


i wOUO 1 


ORF YIB7W 


Saccharomyces 
cerevisiae S288C 


n.d. 


Z37996 CAA86077 1 
Z38062 CAA86209.1 
NC_001141 NP_01 2272.1 




EXG1 


Saccharomyces kluyveri 


n.d. 


AY145000 AA032563.1 






CBS3082 






ndo-1 ,4-glucanase I 


Schizophyllum 


3.2.1.4 




P81190 


(fragment) 


commune 






exo-1 ,3-glucanase 


Schizosaccharomyces 


3.2.1.58 


Z70721 CAA94701.1 


Q10444 


(SPAC12B10.11) 


pombe 972h- 




NC_003424 NP_594643.1 


ORF SPBC1 105.05 


Schizosaccharomyces 
pombe 972h- 


n.d. 


AL096851 CAB50968.1 
NC_003423 NP_596461.1 




ORF SPBC2D 10.05 


Schizosaccharomyces 
pombe 972h- 


n.d. 


NCJ)03423.1 NP_596224.1 




endoglucanase S (EndS) 


Sinorhizobium meiiloti 


3.2.1.4 


AF233448 AAG44364.1 




ceilulase 


Stigmatella aurantiaca 
Sga15 


n.d. 


AJ421825 CAD1 9084.1 




ORF ManA2 (SC2H4.16) 


Streptomyces coelicolor 
A3(2) 


n.d. 


AL031514CAA20610.1 
NC_003888 NP_630335.1 


086599 


ORF SC10F4.10C 


Streptomyces coelicolor 
A3(2) 


n.d. 


AL450350 CAC1 6970.1 
NC_003888 NP_631 677.1 


Q9F3Q0 


ORF SC6D1 1.02c 


Streptomyces coelicolor 
A3(2) 


n.d. 


AL1 58061 CAB76325.1 
NC__003888 NP_631455.1 




ORF SCF1 1.34c (ManA) 


Streptomyces coelicolor 
A3(2) 


n.d. 


AL121746 CAB57406.1 
AL132662CAB59611.1 
AL939106 CAD55266.1 




ORF SCF51A.35 


Streptomyces coelicolor 
A3(2) 


n.d. 


AL121596 CAB56687.1 
NC_003888 NP_624777.1 




ORF SCM1 1.04c 


Streptomyces coelicolor 
A3(2) 


n.d. 


AL133278 CAB61915.1 
NCJ303888 NP_625246.1 




chitosanase II (Choll) 


Streptomyces gnseus 3.2.1 .1 32 


AB088201 BAC65342.1 






HUT 6037 






endo-1 ,4-glucanase 


Streptomyces lividans 


3.2.1.4 


M82807 - 


P27035 


P-mannanase 


Streptomyces lividans 


3.2.1.78 


M92297AAA26710.1 


P51529 
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66 



ORF SSO3007 


Sulfolobus solfataricus 
P2 


n.d. 


AE006891 AAK43109.1 
AL512980 CAC24030.1 




pnHn W-1 A nliiranaco /Frt^ 
ci iuu ~ i ,t yiu^/di loot; ^E-*y i ) 






NC_002754NP_344319.1 




Tataromyces emersonii 


3.2.1.4 


AF440003 AAL33630 1 








AF440013 AAL33639.1 




ORF TTE0061 


Thermoanaerobacter 
tengcongensis MB4T 


n.d. 


AE012980 AAM23368.1 


Q8RDH1 


ORF TTE0359 


Thermoanaerobacter 


n.d. 


AE013009 AAM23649.1 


Q8RCQ7 


P-mannanase A 


tengcongensis MB4T 








Thermoanaerobacterium 
polysaccharolyticum KM- 


3.2.1.78 


U82255 AAD09354.1 


Q9ZA17 


P-1 ,4-glucanase (ENG1 ) 


THCJ 








Thermoascus 


3.2.1.4 


AY055121 AAL16412.1 




aurantiacus IFO 9748 




AF487830 AAL88714.2 




endo-1 ,4-glucanase E5 


Cel5A Thermobifida fusca 


3.2.1.4 


L01577AAC09379.1 


Q01786 


P-mannanase (Man) 


Thermobiftda fusca 


3.2.1.78 


AJ006227 CAA06924.1 




(Thermomonospora 
fusca) 








endo-1 ,4-glucanase 


Thermophilic anaerobe 


3.2.1.4 


AB008029 BAA22939.1 


024820 



1GZJ A 
1H1N A 

1BQCA 

2MAN A 
3MAN A 



ORF TVG0696003 

P-mannanase 
(ManB;TM1227) 



Man5 



endoglucanase (TM1751) CelSA 



ORF TM 1752 

P-mannosidase 
cellulase 

endo-1 ,6-glucanase 



CelSB 



NA10 

Thermoplasma n.d. 
volcanium GSS1 

Thermotoga maritima 3.2.1 .78 



Thermotoga maritima 3.2.1 .4 

Thermotoga maritima n.d. 

Thermotoga neapoiitana 3.2.1.25 

Thermus caldophilus 3.2. 1 .4 

Trichoderma harzianum 3.2. 1 .75 



AP000993 
NC_002689 

AE001779 
Y17980 
NCJ)00853 

AE001813 
NC_000853 
AE001813 
NC_000853 
AY033477 
Y17981 
AF289822 
X79197 



BAB59829.1 

NP_1 11209.1 

AAD36302.1 

CAB56854.1 

NP_229032.1 

AAD36816.1 

NP_229549.1 

AAD36817.1 

NP_229550.1 

AAK53459.1 

CAB56856.1 

AAK60011.1 

CAA55789.1 



P-i 



mannanase 



Trichoderma reesei 
(Hypocrea jecorina) 



3.2.1.78 



L25310AAA34208.1 



ndo-1 ,4-glucanase II 


Cel5A Trichoderma reesei 


3.2.1.4 


M19373 AAA34213.1 


P07982 




(Hypocrea jecorina) 






endo-1 ,4-glucanase II 


Trichoderma viride 


3.2.1.4 


AB021657 BAA36216.1 


093833 


endo-1 ,4-glucanase (Cel) 


Unidentified bacterium 


3.2.1.4 


U12011 AAA91966.1 


Q60054 


Sequence 1 from patent US 


Unknown 


n.d. 


AAE60112.1 




6190899 








Sequence 2 from patent US 


Unknown. 


n.d. 


AAN27517.1 




6440911 








Sequence 4 from patent US 


Unknown. 


n.d. 


AAN27518.1 




6440911 








P-mannanase 


Vibrio sp. MA-138 


3.2.1.78 


D86329BAA25188.1 


069347 


endoglucanase 


Votvariella volvacea V14 


3.2.1.4 


AF329732 AAG59832.1 




endo-1, 4-glucanase (ENG) 


Xanthomonas 


3.2.1.4 


L26543 AAA27599.1 


Q60100 




albilineans 






ORF Egl 


Xanthomonas 
axonopodis pv. citri str. 
306 


n.d. 


AE011625 AAM34920.1 
NCJ)03919 NP_640384.1 




ORF Egl 


Xanthomonas 
axonopodis pv. citri str. 
306 


n.d. 


AE011626 AAM34921.1 
NCJ)03919 NP_640385.1 





Q97AX2 
Q9X0V4 

Q9X273 
Q9X274 



Q12711 
Q12712 

Q99036 1QNOA 
1QNPA 
1QNQA 
1QNRA 
1QNSA 
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ORF Egl 


Xanthomonas 




axonopodis pv. citristr. 




306 


ORF EngXCA 


Xanthomonas 




axonopodis pv. citri str. 




306 


ORF XAC0346 


Xanthomonas 




axonopodis pv. citristr. 




306 


ORF XAC0933 (fragment) 


Xanthomonas 




axonopodis pv. citri str. 




306 


ORF XAC0934 (fragment) 


Xanthomonas 




axonopodis pv. citri str. 




306 


ORF XAC1796 


Xanthomonas 




axonopodis pv. citri str. 




306 


endo-1 ,4-glucanase 


Xanthomonas 




campestris 


EngXCA 


s\CJI 1 11 \\Jt I IKJI fCIO 




camp&stris pv. 




campestris str. ATCC 




33913 


ORF Egl 


XftnthnfYtnnziQ 

S\QI HI IVJI 1 )\JI Iuj 




campestris pv. 




campestris str. ATCC 




33913 


ORF Egl 


s\ai hi t\ji i ivji too 




camDRstri<i nv 




campestris str. ATCC 




33913 


ORF Egl 


s\ai hi Hjinui icio 




caiTiDestris 01/ 




campestris str. A TCC 




33913 


ORF XCC0857 


S\GI HI l\st 1 HJIIaO 




camoBstris nv 




campestris str. ATCC 




33913 


ORFXCC1778 


y\ai HI tsJI i ikjihao 




camna^trift ni/ 

wU f / lf-J\2 O If 1 0 fJw. 




campestris str. ATCC 




33913 


ORF XCC3535 


Xanthomonas 




campestris pv. 




campestris str. ATCC 




33913 


ORFXF0810 


Xyieila fastidiosa 9a5c 


ORF XF0818 


Xyte/la fastidiosa 9a5c 


ORF XF2708 


Xyfeiia fastidiosa 9a5c 


Egi 


Xytella fastidiosa 




Temeculal 


EngXCA 


Xyielia fastidiosa 




Temecuial 


EngXCA 


Xyielia fastidiosa 




Temecuial 


exo-1,3-glucanase 


Yarrowia lipolytics 
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n.d. AE01 1 626 AAM34922.1 
NC_003919NP 640386.1 



n.d. AE01 1689 AAM35501.1 
NC_003919NP_640965.1 

n.d. AE01 1 660 AAM35238.1 
NC_003919 NP_640702.1 

n.d. AE01 1 724 AAM35821 . 1 
NC„003919NP_641285.1 

n.d. AE01 1 724 AAM35822.1 
NC_003919NP_641286.1 

n.d. AE011812AAM36659.1 
NC_003919 NP_642 123.1 

3.2.1.4 M32700AAA27612.1 P19487 

AJ30441 5 CAC1 8529.1 
n.d. AE012472AAM42791.1 
NC_003902NP 638867.1 



n.d. AE012095 AAM39345.1 
NC_003902 NP_635421.1 



n.d. AE012096 AAM39346.1 
NC_003902 NP_635422.1 



n.d. AE012096AAM39347.1 
NC_003902 NP_635423.1 



n.d. AE012185 AAM40172.1 
NCJ)03902 NP_636248.1 



n.d. AE012279AAM41068.1 
NC_003902 NP_637144.1 



n.d. AE012474 AAM42805.1 
NC_003902NP 638881.1 



n.d. AE003920AAF83620.1 Q9PF68 
NC_002488 NP_298 100.1 

n.d. AE003921 AAF83628.1 Q9PF60 

NC_002488 NP_298108.1 

n.d. AE004077AAF85505.1 Q9PA12 

NC_002488 NP_299985.1 
n.d. AE012561 AA029885.1 

NC_004556 NP.780236.1 
n.d. AE012560 AA029683.1 

NC_004556 NP_780034.1 
n.d. AE012560 AA029688.1 

NC.004556 NP_780039.1 
3.2.1.58 Z46872CAA86952.1 Q12725 



http://afmb.cnrs-mrs.fr/CAZY/GH_5.html 



4/10/2003 



Page 14 of 14 



Last updated on 2003 Apr 1 
© Copyright 1998-2003 AFMB-CNRS 

Please send your comments to Bernard Henrissat, Pedro Coutinho or Emeline 

Deleury 



http://afmb.cnrs-mrs.fr/CAZY/GH_5.html 



o 

4/10/2003 



CAZy - CBM 



Page 1 of 2 



CAZy - Carbohydrate-Active 

enZYmes 

Access bv Access bv 
Home Family Organism ActaowMgemeirt. Ut]a Team Citi.gCAZy Search 

Introduction to Carbohydrate- Binding 

Binding Module Families Modules 



Last updated on 
2003 Apr 1 



Introduction 



A carbohydrate-binding module (CBM) is defined as contiguous amino acid sequence within a 
carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity . A few 
exceptions are CBMs in cellulosomal scaffoldin proteins and rare instances of independent 
putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this 
class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such 
as lectins and sugar transport proteins. 



CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial 
discovery of several modules that bound cellulose [1,2]. However, additional modules in 
carbohydrate-active enzymes are continually being found that bind carbohydrates other than 
cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides 
using more inclusive terminology. 

Family classification 

Previous classification of cellulose-binding domains were based on amino acid similarity [3]. 
Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type 
II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called 
families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII [3]. 



The family classification of CBMs is expected to: 

1. aid in the identification of CBMs 

2. in some cases, predict binding specificity 

3. aid in identifying functional residues 

4. reveal evolutionary relationships 

5. be predictive of polypeptide folds 
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This document provides an updated list of the CBM-containing proteins, arranged by CBM 
family. When a protein contains CBDs from different families, it appears in each appropriate 
family. Because the fold of proteins is better conserved than their sequences, some of the CBM 
families are likely to form superfamilies or clans. 



References 

1. Tomme, P. & Claeyssens, M. (1989) Identification of the functional important carboxylgroup in 
cellobiohydrolase I from Trichoderma reesei : a chemical modification study. FEBS Lett. 243, 
239-243 

2. Gilkes, N.R., Warren, R.A., Miller, R.C.J. & Kilburn, D.G. (1988) Precise excision of the 
cellulose binding domains from two Cellulomonas fimi cellulases by a homologous protease and 
the effect on catalysis. J.Biol.Chem. 263, 10401-10407 

3. Tomme, P., Warren, R.A., Miller, R.C., Jr., Kilburn, D.G. & Gilkes, N.R. (1995) in Enzymatic 
Degradation of Insoluble Polysaccharides (Saddler, J.N. & Penner, M., eds.), Cellulose-binding 
domains: classification and properties, pp. 142-163, American Chemical Society, Washington 

Note 

The GenBank, SWISS-PROT, and/or PDB entries for these proteins are therefore mentioned in 
each of the relevant list of family members. 



Please send your comments to Bernard Henrissat, Pedro Coutinho or Emeline 

Deleury 
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CAZy - Carbohydrate- Active 

enZYmes 

Access by Access by 
Home Family Organism Acknowledgements Ljnks Team CitingCAZy Search 

Family CBM3 |Famii y cBM3 i^j gp| 



CAZy Family Carbohydrate- Binding Module Family 3 

Description Carbohydrate-binding module 3. Modules of approx. 150 residues found in bacterial 

enzymes. The cellulose-binding function has been demonstrated in many cases. In one 
instance binding to chitin has been reported. 

3D Structure Status Available (see PDB). Fold P-sandwich 

Note Previously known as cellulose-binding domain family III (CBD III). 

Relevant Links HOMSTRAD; InterPro; PFAM 

Statistics CAZy(62); GenBank/GenPept (71 ); Swissprot (45); PDB (1 0); 3D(4) 



Protein 


Organism 


EC# 


GenBank / GenPept 


Swiss P rot 


scaffoldin CipV 


Acetivibrio cellufofyticus 


n.d. 


AF155197AAF06064.1 


Q9RPL0 


cellulase CelA 


Anaerocellum 


3.2.1.91 


Z86105 CAB06786.1 


P96311 




thermophilum 


3.2.1.4 




endoglucanase A (EngA) 


Bacillus 

amyloliquefaciens 
UMAS1002 


n.d. 


AF363635 AAL99668.1 




endo-1,4-glucanase 3a 


Bacillus lautus. 


3.2.1.4 


A28175 CAA01935.1 




ORF1 


Bacillus lautus 


n.d. 


M76588 AAA22302.1 


P29718 


endoglucanase 


Bacillus pumilus S-27 


3.2.1.4 


AF20671 6 AAF1 5367.1 




endo-1 ,4-glucanase 


Bacillus sp. 5H 


3.2.1.4 


AB016164 BAA31712.1 


083012 


CelS 


Bacillus sp. 79-23 


3.2.1.4 


AF045482 AAC02536.1 


052731 


endoglucanase CelB 


Bacillus sp. BP23 


3.2.1.4 


AJ133614CAB38941.1 


Q9Z4I1 


endo-1 ,4-glucanase 


Bacillus sp. D04 


3.2.1.4 


U27084 AAC43478.1 


Q45430 


endo-1 ,4-glucanase IV 


Bacillus sp. KSM-522 


3.2.1.4 


AB004098 BAA24918.1 


050589 

P28622 


endo-1 ,4-glucanase 


Bacillus subtilis 


3.2.1.4 


M28332 AAA22307.1 


Q45532 


endo-1 ,4-glucanase 


Bacillus subtilis BSE616 


3.2.1.4 


D01057 BAA00859.1 


P23549 


endo-1 ? 4-P-glucanase 


Bacillus subtilis CHZ1 


3.2.1.4 


AY044252 AAK94871.1 




endo-1 ,4-glucanase 


Bacillus subtilis DIG 


3.2.1.4 


M16185AAA22496.1 


P07983 


endo-1, 4-glucanase (EglS or 


Bacillus subtilis subsp. 


3.2.1.4 


AF355629 AAK39540.1 


P10475 


BgIC) 


subtilis str. 168 




X04689 CAA28392.1 
X67044 CAA47429.1 
Z29076 CAA82317.1 
273234 CAA9761 0.1 
Z991 1 3 CAB1 3696.1 
NC_000964 NP_389695.1 


scaffoldin CipBc 


Bacteroides 


none 


AF224509 AAG01 230.1 


Q9FDJ9 


P-1,4-mannanase (ManA) 


cellulosolvens 






CaldibaciHus 


3.2.1.78 


AF1 63837 AAF22274.1 


Q9RFX5 




cellulovorans 





PDB/ 
3D 
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ORF (fragment) 




Caldibacillus 
cellulovorans 


n.d. 


AF200304 AAF61 648.1 


ORF1 (fragment) 




Caldibacillus 
cellulovorans 


n.d. 


AF163837AAF22273.1 


xylanase XynA 




Caldibacillus 


3.2.1.8 


AF200304 AAF61 649.1 


P-mannanase/endo-1 ,4- 




cellulovorans 








Caldicellulosiruptor 


3.2.1.4 


L01257 AAA71 887.1 


glucanase 




saccharolyticus 


3.2.1.78 




CelA 




Caldicellulosiruptor 
saccharolyticus 


3.2.1.4 


L32742AAA91 086.1 


endo-1 ,4- 




Caldicellulosiruptor 


3.2.1.91 


X13602 CAA31 936.1 


glucanase/cellobiohydrolase 




saccharolyticus 


3.2.1.4 




xynC (multidomain protein) 




Caldicellulosiruptor sp. 
Rt69B.1 


3.2.1.8 


AF036924 AAB95326.1 


CelB (multidomain protein) 




Caldicellulosiruptor sp. 
Tok7B.1 


n.d. 


AF078737AAD30364.1 


ORF5 




Caldicellulosiruptor sp. 
Tok7B.1 


n.d. 


AF078038 AAK06388.1 


xylanase/arabinosidase XynA 




Caldicellulosiruptor sp. 
Tok7B.1 


3.2.1.8 
3.2.1.55 


AF078737AAD30363.1 


endo-1 ,4-glucanase B 


Cel9A 


Cellulomonas fimi 


3.2.1.4 


M64644 AAA9^nflfi 1 


ORF CAC0910 




Clostridium 
acetobutylicum ATCC 
824 


n.d. 


AE007606 AAK78886.1 


ORF CAC0913 




Clostridium 
acetobutylicum ATCC 
824 


n.d. 


AE007607AAK78889.1 


ORF CAC0916 




Clostridium 
acetobutylicum ATCC 
824 


n.d. 


AE007607 AAK78892.1 


cellulase H (CelH) 


Cel9H 


Clostridium 
cellulolyticum 


3.2.1.4 


AF316823 AAG45157.1 


cellulase J (CelJ) 


Cel9J 


Clostridium 
cellulolyticum 


3.2.1.4 


AF31 6823 AAG451 58.1 


endo-1 ,4-glucanase G 


Cel9G 


Clostridium 


3.2.1.4 


M87018AAA73868.1 


(CelCCG) 




cellulolyticum 





scaffoldin CipC 

endo-1, 4-glucanase H 
(fragment) 

scaffoldin CbpA 

scaffoldin CipA 

cellobiohydrolase Y (avicelase 
II) (CelY) 

endo-1 ,4-glucanase Z 
(avicelase I) (CelZ) 
endoglucanase Q (CelQ) 

scaffoldin CipA 

ORFZ 

1,4-P-cellobiohydrolase (CelO) 
cellobiohydrolas A (CbhA) 
endo-1 ,4-glucanase (CeIN) 



Clostridium none 
cellulolyticum 

Clostridium cellulovorans 3.2.1.4 

Clostridium cellulovorans none 
Clostridium josui none 
Clostridium stercorarium 3.2.1.91 

Clostridium stercorarium 3.2.1 .4 

Cel9l Clostridium 3.2.1 .4 

thermocellum 

Clostridium thermocellum none 
ATCC 27405 

Clostridium thermocellum n.d. 
DSM 1237 

Clostridium thermocellum 3.2.1.91 
F7 

Clostridium thermocellum 3.2.1.91 
F7 

Clostridium thermocellum 3.2.1.4 
F7 



U40345AAC28899.1 

U34793 AAC38572.2 

M73817AAA23218.1 
AB004845 BAA32429.1 
Z69359 CAA93280.1 

X55299 CAA39010.1 

AB047845 BAB33148.1 



Z68924 CAA93151.2 
AJ420770 CAD1 2660.1 
AJ275975 CAB76938.1 

X80993 CAA56918.1 

AJ275974 CAB76935.1 



P22533 
P22534 
P10474 
052374 

Q9X3P6 

Q9X3P5 

P26225 

Q977Y4 

Q97KK5 

Q97KK3 

Q9EYQ5 
Q9EYQ4 

P37700 1G87 
1GA2 
1K72 
1KFG 

Q45996 1G43A 

065987 
P38058 

P50900 
P23659 

Q06851 1NBCA 
Q46392 

Q59325 
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endo-1 ,4-glucanase F 

endo-1 ,4-glucanase I (Cell or 
CMC) 

endo-1 ,4-glucanase N 

endo-1 ,4-glucanase V1 

endo-1, 4-glucanase V 

endo-1, 4-glucanase CelA 
(fragment) 

cellulose-binding protein E1 
(Cbpel) 

endo-1 ,4-glucanase A 
cellobiohydrolase Cel48C 
cellulase VI (Cel9A) 

ORF SCF11.15 

endo/exo-1 ,4-glucanase E4 



endo-1 ,4-glucanase 

Sequence from patent 

Sequence 10 from patent US 
6429000 



Cel9B Clostridium th rmocellum 
NCIB 10682 

Cel9C Clostridium thermocellum 
NCIB10682/F7 
Erwinia carotovora 
atroseptica 
Erwinia carotovora 
SCC3193 

Erwinia carotovora 
SCRI193 

Erwinia carotovora 
subsp. carotovora LY34 
Eubacterium 
cellulosolvens 
Paenibacillus lautus 
Cel48C Paenibacillus sp. BP-23 
Cel9A Ruminococcus albus F- 
40 

Streptomyces coelicolor 
A3(2) 

Cel9A Thermobiftda fusca 



Thermophilic anaerobe 3.2.1 .4 
NA10 



Unknown 
Unknown. 



3.2.1.4 X60545 CAA43035.1 P26224 

3.2.1 .4 L04735 AAA20892.1 Q02934 

AJ275974CAB76932.1 
3.2.1 .4 L39788 AAC37033.1 Q59394 

3.2.1.4 X79241 CAA55823.1 Q59395 

3.2.1.4 X76000CAA53592.1 Q47096 

3.2.1.4 AF025768AAC02964.1 031029 

n.d. AB072270 BAB86305.1 

3.2.1 .4 M76588 AAA22303.1 P29719 
3.2.1 .91 AJ488933 CAD32945.1 
ad. AB028321 BAB64431.1 

n.d. AL1 32662 CAB59592.1 
NC_003888 NP_624849.1 
3.2.1.4 L20093AAB42155.1 P26221 1JS4A 
3.2.1.91 Q08167 1TF4A 

3TF4A 
4TF4A 

AB008029 BAA22939.1 024820 

none -AAE1 6382.1 

-AAO30719.1 
n.d. AAN26180.1 



Last updated on 2003 Apr 1 
© Copyright 1998-2003 AFMB-CNRS 

Please send your comments to Bernard Henrissat, Pedro Coutinho or Emeline 

Deleury 
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CAZy - Carbohydrate-Active 

enZYmes 

Access t)y Access bv 
Home Family Organism Acknowledgements ^ Team CitingCAZy Search 

Family CBM2 | Family cbm2 ^j go] 



CAZy Family Carbohydrate-Binding Module Family 2 

Description Carbohydrate-binding module 2. Modules of approx. 100 residues and which are found 
in a large number of bacterial enzymes. The cellulose-binding function has been 
demonstrated in many cases. Several of these modules have been shown to also bind 
chitin or xylan. 

3D Structure Status Available (see PDB) 

Note Previously known as cellulose-binding domain family II (CBD II). 

Relevant Links HOMSTRAD; InterPro; PFAM 

Statistics CAZy(107); GenBank/GenPept (143); Swissprot (83); PDB (10); 3D(1 1) 



Protein 




Organism 


EC# 


GenBank/GenPept 


SwissProt 


PDB/ 
3D 


endo-1 ,4-glucanase 




Acidothermus cellulolyticus 


3.2.1.4 


U33212AAA75477.1 


P54583 


3D 










AX467594CAD42489.1 






endo-1 ,4-glucanase 




Actinomyces sp. 40 


3.2.1.4 


U94825 AAC06196.1 


066064 




ORF PCZA361.11 




Amycolatopsis orientalis 


n.d. 


AJ223998 CAA1 1769.1 


052799 




(putative oxidoreductase) 








ORF egl 




Azorhizobium caulinodans 


n.d. 


Z48958 - 










ORS571 








BA_0961 




Bacillus anthracis str. A2012 


n.d. 


NC_003995 NP_654321.1 






chitinase CW (ChiCW) 




Bacillus cereus 28-9 


3.2.1.14 


AF416570 AAM48520.1 






chitinase B 




Bacillus cereus CH 


3.2.1.14 


AB041932 BAB1 6891.1 


Q9FAC8 




chitinase 




Bacillus thurlngiensis 


3.2.1.14 


U89796 AAB58579.1 


007088 




chitinase (Chi) 




Bacillus thuringiensis 


3.2.1.14 


AF424979 AAL17867.1 






chitinase 








AY074882 AAL71 886.2 








Bacillus thuringiensis serovar 


n.d. 


AF526379 AAM88400.1 










israelensis 








chitinase (Kchi) 




Bacillus thuringiensis serovar 


n.d. 


AY189740 AA034713.1 










kurstaki 








chitinase 




Bacillus thuringiensis serovar 
sotto 


n.d. 


AY129671 AAM94024.1 






endo-1 ,4-glucanase 1 




Butyhvibrio fibrisolvens H17c 


3.2.1.4 


-AAE29745 


P20847 












X17538CAA35574.1 






cellobiohydrolase A 


Cel6B 


Cellulomonas fimi 


3.2.1.91 


L25809 AAC36898.1 


P50401 




cellobiohydrolase B 


Cel48A 


Cellulomonas fimi 


3.2.1.91 


L38827 AAB00822.1 


P50899 




endo-1 ,4-glucanase A 


Cel6A 


Cellulomonas fimi 


3.2.1.4 


M15823 AAA23084.1 


P07984 




endo-1, 4-glucanase B 


Cel9A 


Cellulomonas fimi 


3.2.1.4 


M64644 AAA23086.1 


P26225 




endo-1 ,4-glucanase D 


CelSA 


Cellulomonas fimi 


3.2.1.4 


L02544 AAA23089.1 


P50400 




xylanase (Cex) - 


XynlOA Cellulomonas fimi 


3.2.1.8 


M15824 AAA56791.1 


P07986 


1EXG 


xylanase D 








L11080AAA56792.1 


Q59277 


1EXH_ 


Xyn11A Cellulomonas fimi 


3.2.1.8 


X76729CAA54145.1 


P54865 


1E5BA 
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3.1.1.72 1E5CA 

1HEHC 
1HEJC 
1XBD_ 
2XBD 



endo-1, 4-glucanase B 


Cellulomonas flavtgena 


3.2.1.4 


AF1 72345 AAD48494.2 












xyianase (fragment) 


Cellulomonas flavigena 


3.2.1.8 


AF338352 AAK1 5536.1 


Q9AG99 




CDBB5321 








xylanase 


Xyn11A Cellulomonas pachnodae 


3.2.1.8 


AF120156 AAD54767.1 


Q9RQB8 


chitinase 63 (Chi63) 


Cellulomonas $p. GM13 


3.2.1.14 


AF181718AAF00931.2 




endo-b1 ,4-mannanase 
5B 


Man5B Cellvibrib japonicus 


n.d. 


AY187032AAO31760.1 




arabinofuranosidase C 


CelMbrio japonicus 


3.2.1.55 


X54523 CAA38390.1 


P23031 




(formerly Pseudomonas 










cel/ulosa) 








cellodextrinase C 


Cellvibrio japonicus 


3.2.1. - 


X61299 CAA43597.1 


P27033 




(formerly Pseudomonas 










cellulosa) 








endo-1 ,4-glucanase A 


Cellvibrio japonicus 


3.2.1.4 


X12570 CAA31 082.1 


P10476 




(formerly Pseudomonas 










cellulosa) 








endo-1 ,4-glucanase B 


Cellvibrio japonicus 


3.2.1.4 


X52615 CAA36844.1 


P18126 




(formerly Pseudomonas 










cellulosa) 








endo-1,4-glucanase E 


Cellvibrio japonicus 


3.2.1.4 


X86798 CAA60493.1 






(formerly Pseudomonas 








cellulosa) 








esterase D 


Cellvibrio japonicus 


n.d. 


X58956 CAA41 727.1 


Q51815 




(formerly Pseudomonas 








cellulosa) 








pectate lyase (PelA) 


PeHOA Cellvibrio japonicus 


4.2.2.2 


AF279264 AAG29353 1 


Q9F7L3 




(formerly Pseudomonas 










cellulosa) 








rhamnogalacturonan 


Rgl11A Cellvibrio japonicus 


4.2.2.- 


AY026755 AAK2091 1 .1 


Q9AF09 


lyase 


(formerly Pseudomonas 








cellulosa) 








xylanase A 


XynWA Cellvibrio japonicus 


3.2.1.8 


X15429 CAA33469.1 


P14768 




(formerly Pseudomonas 








cellulosa) 








xylanase B 


Cellvibrio japonicus 


3.2.1.8 


X54523 CAA38389.1 


P23030 




(formerly Pseudomonas 








cellulosa) 








l*l II U I loot/ VV./HU- | 


ksMoreiia virus CVK2 


3.2.1.14 


AB022343 BAA78554.1 


Q9WAX4 


/^nlli /%,IA 

Cellulase CelA 


Cfavibacter michiganensis 


3.2.1.4 


X62582 CAA44467.2 


Q9K5C7 












cellulase CelA 


Clavibacter michiganensis 


3.2.1.4 


AY007311 AAK1 6222.1 


Q9AF65 




subsp. sepedonicus 








endo-1 ,4-glucanase D 


Clostridium cellulovorans 


3.2.1.4 


M37434 AAA23233.1 


P28623 


endo-1 ,4-glucanase 


Clostridium tongisporum 


3.2.1.4 


L02868 AAC37035.1 


P54937 


(CelA) 






CMC-xylanase 


Fibrobacter succinogenes 


3.2.1.4 


U94826 AAC06197.1 


066065 


(fragment) 


S85 


3.2.1.8 




endoglucanase 1 (Engl) 


Globodera rostochiensis 


3.2.1.4 


AF004523AAC48325.1 


016028 








AF056110 AAC63988.1 




endoglucanase 1 (Engl) 


Heterodera glycines 


3.2.1.4 


AF052733 AAC1 5707.1 


018453 








AF006052 AAC48327.1 




P-1 ,4-endoglucanase 






A79341 CAB59 144.1 




Meloidogyne incognita 


3.2.1.4 


AF323087 AAK21 882.1 


Q9BJZ1 


Eng-1a 










cellulos -binding protein 


Meloidogyne incognita 


none 


AF049139 AAC05133.1 


061530 
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endo-1 ,4-glucanase 1 
(Eng-1) 

ORF X (fragment) 

endo-1,4-giucanase A 
endo-1,4-glucanase 

ORF MT2041 

ORF Rv1987 

endoxylanase (XynA) 
ORF A181/182R 

ORF PF1233 

ORF RSp0583 

chitinase C 
(2SC6G5.20C) 

ORF 2SCC1 3.07c 

ORF AxeA 

ORF CelB (SCG11A.18) 

ORF CelS2 
(SCG11A.19) 
ORF SC10F4.10C 

ORF SC4H2.07C 

ORF SC5C7.30C 

ORF SC5C7.31C 

ORF SC5C7.33 

ORF SC7H1.24 

ORF SCF1 1.34c (ManA) 

ORF SCF91.34C (XlnA) 

ORF XlnB 

chitinase lit (Chi III) 
(fragment) 

cellulose binding protein 
(ORF2) 

endo-1 ,4-glucanase 
xylanase 

acetyl xylan esterase 
chitinase C 
endo-1 ,4-glucanase 
endo-1 ,4-glucanas 
xylanase B 
chitinase C 
chitinase-63 



Meloidogyne incognita 3.2. 1 A 



Methylococcus capsulatus 


n.d. 


Bath 




Microbispora bispora 


3.2.1.4 


Micromonospora 


3.2.1.4 


cellulolyticum 




Mycobacterium tuberculosis 


n.d. 


CDC1551 




Mycobacterium tuberculosis 


n.d. 


H37Rv 




Nonomuraea flexuosa 


3.2.1.8 


Paramecium bursaria 


3.2.1.14 


Chlorella virus 1 




Pvrnrnm i q ft inn cj / q D^M 


n rl 

1 1.U. 


3638 




f\afoit/f net ou/ 01 iooc?o» t/f f * 


n rl 

I I.U. 


GMI1000 




Strf*nfnm\/rf*s mpitcnlnr A*k 


^ 9 1 14 


(2) 




Streptomyces coelicolor A3 


n.d. 


(2) 




Streptomyces coelicolor A3 


n.d. 


(2) 




Streotomvces coelicolor A3 


n.d. 


(2) 




Streotomvces coelicolor A3 


n.d. 


(2) 




Streotomvces coelicolor A3 


n.d. 


(2) 




Streotomvces coelicolor A3 


n.d. 


(2) 




Streotomvces coelicolor A3 


n.d. 


(2) 




Streptomyces coelicolor A3 


n.d. 


(2) 




Stre p tomyces coelicolor A 3 


n.d. 


(2) 




Streptomyces coelicolor A3 


n.d. 


(2) 




Streptomyces coelicolor A3 


n.d. 


(2) 




Streptomyces coelicolor A3 


n.d. 


(2) 




Streptomyces coelicolor A3 


n.d. 


(2) 




Streptomyces griseus HUT 


3.2.1.14 


6037 




Streptomyces halstedii 


n.d. 


oirepiomyces naisieati 


1 O 4 A 


&irepiomyces naisieoii J Mo 


O O A O 
O.^.I.O 


Streptomyces lividans 


3.1.1.72 


Streptomyces lividans 


3.2.1.14 


Streptomyces lividans 


3.2.1.4 


Streptomyces lividans 


3.2.1.4 


Streptomyces lividans 


3.2.1.8 


Streptomyces peucetius 


3.2.1.14 


Streptomyces plicatus 


3.2.1.14 



AF1 00549 AAD45868.1 Q9UA57 



AF447860 AAL40976.1 



P26414 

S76408 AAC60491.1 Q53488 



AE007056 AAK46316.1 
NC__002755 NP_336502.1 

Z74025 CAA98397.1 Q10870 
NC_000962 NP_216503.1 

AJ508952 CAD48747.1 

U42580 AAC96549.2 Q84501 
NC_000852 NP_048529.1 Q84502 

AE010230 AAL81357.1 Q8U1H5 
NC_003413 NP_578962.1 

AL646079 CAD1 7734.1 Q8XS97 
NC_003296 NP_522144.1 

AB017010 BAA75644.1 Q9Z9M8 
AL359152 CAB94547.1 
NC_003888 NP_629515.1 

AL4421 65 CAC1 01 08.1 Q9F3I2 
NC_003888 NP_627029.1 

AL1 33220 CAB61737.1 Q9RKN7 
NCJ)03888 NP_626539.1 

AL1 332 1 0 CAB61 599.1 Q9RJY3 
NC_003888 NP_625477.1 

AL1 33210 CAB61 600.1 Q9RJY2 
NC_003888 NP_625478.1 

AL450350 CAC1 6970.1 Q9F3Q0 
NC_003888NP_631677.1 

AL022268 CAA1 8323.1 069962 
NC_003888 NP_629910.1 

AL031515 CAA20642.1 086727 
NC_003888 NP_630626.1 

AL031515 CAA20643.1 086728 
NC_003888 NP_630627.1 

AL031515 CAA20645.1 086730 
NC_003888 NP_630629.1 

AL021411 CAA16211.1 054183 
NC_003888 NP_630071.1 
AL1 32662 CAB5961 1.1 
AL939106 CAD55266.1 

AL1 32973 CAB61 191.1 Q9RJ91 
NC_003888 NP_624983.1 

AL1 33220 CAB61 738.1 Q9RKN6 
NC_003888 NP_626540.1 
AB081807 BAB86377.1 

U51222 AAC45430.1 007862 



U51222 AAC45429.1 008468 

U41627AAC45554.1 Q59922 1NQ6 

M64552 AAC061 15.2 Q54413 

D12647 BAA021 68.1 P36909 

M82807 - P27035 

U04629 AAB71 950.1 Q54331 3D 

M64552 AAC06114.2 P26515 

AF206633 AAF43629.1 Q9L8G0 

M1 8397 AAA267 17.1 P1 1 220 
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endo-1 ,4-glucanase S 
cellutase 12A 

xylanase B 

acetyl xylan esterase 
STX-III 

xylanase II 

chitin-binding protein 
celS2 

endo-1 ,4-glucanase S1 
xylanase (SvxA) 
ORF slr0897 



Streptomyces rochet A2 
Cel12A Streptomyces sp. 11 AG8 

Streptomyces 

thermocyaneoviolaceus 

Streptomyces 

thermoviolaceus 

Streptomyces 

thermoviolaceus 

Streptomyces viridosporus 

Streptomyces viridosporus 
Streptomyces viridosporus 
Synechocystis sp. PCC 6803 



cellobiohydrolase E3 
endo-1 ,4-glucanase E1 
endo-1 f 4-glucanase E2 
endo-1,4-glucanase E5 
endo/exo-1 ,4-glucanase 
E4 

exocellulase E6 (CelF) Cel48A 

xylanase 

chitinase A 

xylanase 

unnamed protein 
product 

Sequence 2 from patent 
US 6207436 
ORF EngXCA 



Cel6B Thermobifida fusca 

Cel9B Thermobifida fusca 

Cel6A Thermobifida fusca 

Cel5A Thermobifida fusca 

Cel9A Thermobifida fusca 



Thermobifida fusca 
Thermobifida fusca 
Thermococcus 
kodakaraensis 
Thermomonospora alba 
unidentified 



endo-1 ,4-glucanase 

EngXCA 

ORF XF0818 

ORF XF1267 

EngXCA 

GuxA 



Unknown. 

Xanthomonas axonopodis pv. 
citri str. 306 

Xanthomonas campestris 

Xanthomonas campestris pv. 
campestris str, ATCC 33913 
Xylella fastidiosa 9a5c 

Xylella fastidiosa 9a5c 

Xylella fastidiosa Temeculal 

Xylella fastidiosa Temeculal 



M82804 AAA26720.1 
3.2.1.4 X73953CAA52139.1 Q59963 

3.2.1.4 AF233376AAE59731.1 Q9KIH1 

AAF91283.1 
3.2.1.8 AF1 94025 AAF04601.1 Q9RMM4 

3.1.1.6 D85898 BAA19779.1 008347 

3.1.1.72 

3.2.1.8 D85897BAA1 9778.1 008346 



AF1 26376 AAD27623.1 Q9X5K8 



10A4 



AF1 30408 

AF198618 

D64003 
NC_000911.1 

U 18978 

L20094 

M73321 

L01577 

L20093 



none 

3.2.1.4 
3.2.1.8 
n.d. 

3.2.1.91 
3.2.1.4 
3.2.1.4 
3.2.1.4 
3.2.1.4 
3.2.1.91 
3.2.1.91 
3.2.1.8 
3.2.1.14 

3.2.1.8 
n.d. 

n.d. 

n.d. AE011689 

NC_003919 
3.2.1.4 M32700 
AJ304415 
n.d. AE012472 

NC003902 
n,d. AE003921 

NCJ)02488 
n.d. AE003960 

NC 002488 
n.d. AE012560 

NC_004556 
n.d. AE012555 

NC 004556 



AAD25090.1 

AAF09501.1 

BAA10447.1 
NP_442377.1 

AAA62211.1 

AAC06387.1 

AAC06388.1 

AAC09379.1 

AAB42155.1 



AF144563 AAD39947.1 
U01242AAA21480.1 
T43916 

Z81013CAB02654.1 
AX565635 CAD58415.1 



Q55365 

Q60029 
Q08166 
P26222 
Q01786 
P26221 
Q08167 

Q56265 



P74912 



AAE63612.1 

AAM35501.1 

NP_640965.1 

AAA27612.1 P19487 
CAC1 8529.1 
AAM42791.1 

NP_638867.1 

AAF83628.1 Q9PF60 
NP_298108.1 

AAF84076.1 Q9PDW2 
NP_298556.1 

AA029683.1 

NP_780034.1 

AAO28402.1 

NP 778753.1 



3D 
3D 
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We are working hard at putting together and maintaining this server for the scientific community. 
If you found this server useful for your research we would appreciate if you cite CAZy(ModO). 
There are several ways to cite CAZy(ModO) (see below). 

1 . Mention this server name and URL: 

Coutinho, P.M. & Henrissat, B. (1999) Carbohydrate-Active Enzymes server at 
URL: http://afinb.cnrs-mrs.fr/~cazy/CAZY/index.html 

2. Quote papers where CAZy(ModO) is revealed: 

Coutinho, P.M. & Henrissat, B. (1999) Carbohydrate-active enzymes: an integrated database 
approach. In "Recent Advances in Carbohydrate Bioengineering", HJ. Gilbert, G. Davies, B. 
Henrissat and B. Svensson eds., The Royal Society of Chemistry, Cambridge, pp. 3-12. 

Coutinho, P.M. & Henrissat, B. (1999) The modular structure of cellulases and other 
carbohydrate-active enzymes: an integrated database approach. In "Genetics, Biochemistry and 
Ecology of Cellulose Degradation 11 ,, K. Ohmiya, K. Hayashi, K. Sakka, Y. Kobayashi, S. Karita 
and T. Kimura eds., Uni Publishers Co., Tokyo, pp. 15-23. 

3. Quote the references concerning the family classification(s) in 
question: 

Glycosidases and Transglycosidases (or Glycoside Hydrolases) 

Glycosyltransferases 

Polysaccharide Lyases (cite CAZyModO) 
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Swiss-Prot: P22533 



NiceProt - a user-friendly view of this Swiss-Prot entry 



ID 
AC 
DT 
DT 
DT 
DE 
DE 
DE 
GN 
OS 
OC 
OC 
OX 
RN 
RP 
RX 
RA 
RT 
RT 
RL 
RN 
RP 
RX 
RA 
RT 
RT 
RT 
RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



MANB_CALSA 
P22533; 
01-AUG-1991 
01-JUL-1993 
28-FEB-2003 



STANDARD; 



PRT; 1331 AA. 



(Rel. 19, Created) 
(Rel. 26, Last sequence update) 
(Rel. 41, Last annotation update) 
Beta-mannanase/endoglucanase A precursor [Includes: Mannan endo-1,4- 
beta-mannosidase A (EC 3.2.1.78 ) (Beta-mannanase) (Endo-1,4- 
mannanase) ; Endo-1, 4-beta-glucanase (EC 3.2.1.4 ) (Cellulase) ] . 
MAN A. 

Caldocellum saccharolyticum (Caldicellulosiruptor saccharolyt icus ) . 
Bacteria ; Firmicutes ; Clostridia ; Clostridiales ; Syntrophomonadaceae ; 
Caldicellulosiruptor . 
NCB I TaxID= 44001 ; 
[1] 

SEQUENCE FROM N.A. 

MEDLINE=93119139; PubMed=1476429; [ NCBI , ExPASy , EBI , 
Gibbs M.D. , Saul D. J. , Luthi E . , Bergquist P.L. ; 



'Caldocellum saccharolyticum 1 
58:3864-3867 (1992) . 



Israel , Japan ] 
is part of a 



"The beta-mannanase from 
multidomain enzyme . " ; 
Appl . Environ. Microbiol 
[2] 

SEQUENCE OF 1-346 FROM N.A. 

MEDLINE=91247819; PubMed=203 9230 ; [ NCBI , ExPASy , EBI , Israel , Japan ] 
Luethi E . , Jasmat N . B . , Grayling R. A. , Love D . R . , Bergquist P.L. ; 
"Cloning, sequence analysis, and expression in Escherichia coli of a 
gene coding for a beta-mannanase from the extremely thermophilic 
bacterium 'Caldocellum saccharolyticum'."; 
Appl. Environ. Microbiol. 57:694-700(1991). 

-!- FUNCTION: DEGRADATION OF HEMICELLULOSES , THE SECOND MOST ABUNDANT 

POLYSACCHARIDES IN NATURE. CONTAINS TWO CATALYTIC DOMAINS WITH 

MANN ANAS E AND ENDOGLUCANASE ACTIVITIES. 
-!- CATALYTIC ACTIVITY: Random hydrolysis of 1 , 4 -beta-D-mannosidic 

linkages in mannans, galactomannans , glucomannans, and 

galactoglucomannans . 

CATALYTIC ACTIVITY: Endohydrolysis of 1 , 4 -beta-D-glucosidic 
linkages in cellulose, lichenin and cereal beta-D-glucans . 
MISCELLANEOUS: THIS ENZYME IS MOST ACTIVE AT PH 6 AND 80 DEGREES 
CELSIUS . 

SIMILARITY: IN THE N- TERMINAL SECTION; BELONGS TO CELLULASE FAMILY 
A (FAMILY 5 OF GLYCOSYL HYDROLASES ) . 

SIMILARITY: IN THE C- TERMINAL SECTION; BELONGS TO CELLULASE FAMILY 
J (FAMILY 44 OF GLYCOSYL HYDROLASES) . 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
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SQ 



modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http : //www. isb- sib . ch/announce/ 
or send an email to license®isb-sib. ch) . 



[ EMBL / GenBank / DDBJ ] 
[ EMBL / GenBank / DDBJ] 



[ CoDingSeguence ] 
[ CoDingSequence ] 



EMBL; L01257; AAA71887.1; 
EMBL; M36063; AAA72861.1; 
PIR; B43745; B43745. 
PIR; A48954; A48954 . 

HSSP; Q06851; 1NBC. [ HSSP ENTRY / SWISS -3DIMAGE / PDB ] 

InterPro; IPR001956; CBD 3. 

InterPro; IPR0Q1547; Glyco hydro 5. 

InterPro ; Graphical view of domain structure. 

Pfam; PF00150; cellulase; 1. 

Pfam; PF00942; CBM 3; 2. 

ProDom; PD0Q1947; CBD 3; 2. 

ProDom [ Domain structure / List of seq. sharing at least 1 domain ] 

PRQSITE; PS00659; GLYCOSYL HYDROL F5 ; 1. 

BLOCKS; P22 533. 

ProtoNet; P22533. 

ProtoMap; P22533. 

PRESAGE; P22533. 

DIP; P22533. 

ModBase; P22533. 

SWISS -2DPAGE; GET REGION ON 2D PAGE . 

Hydrolase ; Glycosidase ; Cellulose degradation ; Signal ; 



1 


41 


42 


1331 


42 


325 


326 


361 


362 


518 


519 


564 


565 


720 


721 


780 


781 


1331 


162 


162 


257 


257 


338 


338 


340 


346 



POTENTIAL . 

BETA - MANNANASE / ENDOGLUCANASE A. ^ 
CATALYTIC (MANNANASE ACTIVITY) . GM5 
PRO/SER/THR-RICH (PT BOX) . ->V tT 

SUBS TRATE - BINDING, (POTENTIAL) . C^Bj> X** 
PRO/ S ER/THR - RI CH ( PT BOX) . 

SUBSTRATE-BINDING (PO TENTIAL) . C,Q])TJ1 
PRO/SER/THR-RICH (PT BOX) . 
CATALYTIC (ENDOGLUCANASE ACTIVITY) . 
PROTON DONOR (BY SIMILARITY) . 
NUCLEOPHILE (BY SIMILARITY) . 
T -> P (IN REF. 2) . 
TPTPTPT -> RQHQHRQ (IN REF. 2) . 
MW; FFBCA51BB8D8F0E0 CRC64 ; 



SIGNAL 
CHAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
DOMAIN 
ACT_SITE 
ACT_SITE 
CONFLICT 
CONFLICT 

SEQUENCE 1331 AA; 146892 
MRLKTKIRKK WLSVLCTWF LLNILFIANV TILPKVGAAT SNDGWKIDT STLIGTNHAH 
CWYRDRLDTA LRGIRSWGMN SVRWLSNGY RWTKIPASEV ANIISLSRSL GFKAIILEVH 
DTTGYGEDGA ACSLAQAVEY WKEIKSVLDG NEDFVIINIG NEPYGNNNYQ NWVNDTKNAI 
KALRDAGFKH TIMVDAPNWG QDWSNTMRDN AQSIMEADPL RNLVFSIHMY GVYNTASKVE 
EYIKSFVDKG LPLVIGEFGH QHTDGDPDEE AIVRYAKQYK IGLFSWSWCG NSSYVGYLDM 
VNNWDPNNPT PWGQWYKTNA IGTSSTPTPT STVTPTPTPT PTPTPTVTAT PTPTPTPVST 
PATSGQIKVL YANKETNSTT NTIRPWLKW NSGSSSIDLS RVTIRYWYTV DGERAQSAIS 
DWAQIGASNV TFKFVKLSSS VSGADYYLEI GFKSGAGQLQ PGKDTGEIQM RFNKDDWSNY 
NQGNDWSWIQ SMTSYGENEK VTAYIDGVLV WGQEPSGATP APAPTATPTP TPTVTPTPTV 
TPTPTVTATP TPTPTPTPTP VSTPATGGQI KVLYANKETN STTNTIRPWL KWNSGSSSI 
DLSRVTIRYW YTVDGERAQS AISDWAQIGA SNVTFKFVKL SSSVSGADYY LEIGFKSGAG 
QLQPGKDTGE IQIRFNKSDW SNYNQGNDWS WIQSMTSYGE NEKVTAYIDG VLVWGQEPSG 
TTPSPTSTPT VTVTPTPTPT PTPTPTPTVT PTPTVTPTPT VTATPTPTPT PIPTVTPLPT 
ISPSPSWEI TINTNAGRTQ ISPYIYGANQ DIEGWHSAR RLGGNRLTGY NWENNFSNAG 
NDWYHSSDDY LCWSMGISGE DAKVPAAWS KFHEYSLKNN AYSAVTLQMA GYVSKDNYGT 
VSENETAPSN RWAEVKFKKD APLSLNPDLN DNFVYMDEFI NYLINKYGMA SSPTGIKGYI 
LDNEPDLWAS THPRIHPNKV TCKELIEKSV ELAKVIKTLD PSAEVFGYAS YGFMGYYSLQ 
DAPDWNQVKG EHRWFISWYL EQMKKASDSF GKRLLDVLDL HWYPEARGGN IRVCFDGEND 
TSKEWIARM QAPRTLWDPT YKTSVKGQIT AGENSWINQW FSDYLPIIPN VKADIEKYYP 
GTKLAISEFD YGGRNHISGG IALADVLGIF GKYGVNFAAR WGDSGSYAAA AYNIYLNYDG 
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